SEQUENCE LISTING 



<110> Wright, David A. 
Voytas, Daniel F. 

<120> Plant Retroelement s and Methods Related Thereto 

<130> P-1065 ISURF Plant Retroelement 

<140> unknown 
<141> 1999-05-28 

<150> 60/087125 
<151> 1998-05-29 

<160> 42 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 18 

<212> DNA 

<213> Glycine max 

<400> 1 

tggcgccgtt gccaattg 

<210> 2 

<211> 18 

<212> DNA 

<213> Glycine max 

<400> 2 

tggcgccgtt gtcgggga 

<210> 3 

<211> 6 

<212> DNA 

<213> Glycine max 

<400> 3 
ttgggg 

<210> 4 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 



<400> 4 

Met Ala Ser Arg Lys Arg Lys 
1 5 



<210> 5 
<211> 1263 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 



<400> 5 

atggcctccc gtaaacgcaa agctgtgccc acacccgggg aagcgtccaa ctgggactct 60 
tcacgtttca ctttcgagat tgcttggcac agataccagg atagcattca gctccggaac 120 
atccttccag agaggaatgt agagcttgga ccagggatgt ttgatgagtt cctgcaggaa 180 
ctccagaggc tcagatggga ccaggttctg acccgacttc cagagaagtg gattgatgtt 240 
gctctggtga aggagtttta ctccaaccta tatgatccag aggaccacag tccgaagttt 300 
tggagtgttc gaggacaggt tgtgagattt gatgctgaga cgattaatga tttcctcgac 360 
accccggtca tcttggcaga gggagaggat tatccagcct actctcagta cctcagcact 420 
cctccagacc atgatgccat cctttccgct ctgtgtactc cagggggacg atttgttctg 480 
aatgttgata gtgccccctg gaagctgctg cggaaggatc tgatgacgct cgcgcagaca 540 
tggagtgtgc tctcttattt taaccttgca ctgacttttc acacttctga tattaatgtt 600 
gacagggccc gactcaatta tggcttggtg atgaagatgg acctggacgt gggcagcctc 660 
atttctcttc agatcagtca gatcgcccag tccatcactt ccaggcttgg gttcccagcg 720 
ttgatcacaa cactgtgtga gattcagggg gttgtctctg ataccctgat ttttgagtca 780 
ctcagtcctg tgatcaacct tgcctacatt aagaagaact gctggaaccc tgccgatcca 840 
tctatcacat ttcaggggac ccgccgcacg cgcaccagag cttcggcgtc ggcatctgag 900 
gctcctcttc catcccagca tccttctcag cctttttccc agagaccacg gcctccactt 960 
ctatccacct cagcacctcc atacatgcat ggacagatgc tcaggtcctt gtaccagggt 1020 
cagcagatca tcattcagaa cctgtatcga ttgtccctac atttgcagat ggatctgcca 1080 
ctcatgactc cggaggccta tcgtcagcag gtcgccaagc taggagacca gccctccact 1140 
gacagggggg aagagccttc tggagccgct gctactgagg atcctgccgt tgatgaagac 1200 
ctcatagctg acttggctgg cgctgattgg agcccatggg cagacttggg cagaggcagc 1260 
tga 1263 



<210> 6 
<211> 421 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence; plant 
retroelement sequence 

<400> 6 

Met Ala Ser Arg Lys Arg Lys Ala Val Pro Thr Pro Gly Glu Ala Ser 
15 10 15 



2 



Asn Trp Asp Ser Ser Arg Phe Thr Phe Glu He Ala Trp His Arg Tyr 
20 25 30 



Gin Asp Ser He Gin Leu Arg Asn He Leu Pro Glu Arg Asn Val Glu 
35 40 45 

Leu Gly Pro Gly Met Phe Asp Glu Phe Leu Gin Glu Leu Gin Arg Leu 
50 55 60 

Arg Trp Asp Gin Val Leu Thr Arg Leu Pro Glu Lys Trp He Asp Val 
65 70 75 80 

Ala Leu Val Lys Glu Phe Tyr Ser Asn Leu Tyr Asp Pro Glu Asp His 
85 90 95 

Ser Pro Lys Phe Trp Ser Val Arg Gly Gin Val Val Arg Phe Asp Ala 
100 105 HO 

Glu Thr He Asn Asp Phe Leu Asp Thr Pro Val He Leu Ala Glu Gly 
115 120 125 

Glu Asp Tyr Pro Ala Tyr Ser Gin Tyr Leu Ser Thr Pro Pro Asp His 
130 135 140 

Asp Ala He Leu Ser Ala Leu Cys Thr Pro Gly Gly Arg Phe Val Leu 
i45 150 155 160 

Asn Val Asp Ser Ala Pro Trp Lys Leu Leu Arg Lys Asp Leu Met Thr 
165 170 175 

Leu Ala Gin Thr Trp Ser Val Leu Ser Tyr Phe Asn Leu Ala Leu Thr 
180 185 190 

Phe His Thr Ser Asp He Asn Val Asp Arg Ala Arg Leu Asn Tyr Gly 
195 200 205 

Leu Val Met Lys Met Asp Leu Asp Val Gly Ser Leu He Ser Leu Gin 
210 215 220 

lie Ser Gin He Ala Gin Ser He Thr Ser Arg Leu Gly Phe Pro Ala 
225 230 235 240 

Leu He Thr Thr Leu Cys Glu He Gin Gly Val Val Ser Asp Thr Leu 
245 250 255 

He Phe Glu Ser Leu Ser Pro Val He Asn Leu Ala Tyr He Lys Lys 
260 265 270 



Asn Cys Trp Asn Pro Ala Asp Pro Ser He Thr Phe Gin Gly Thr Arg 
275 280 285 



3 



Arg Thr Arg Thr Arg Ala 
290 

Ser Gin His Pro Ser Gin 
305 310 

Leu Ser Thr Ser Ala Pro 
325 

Leu Tyr Gin Gly Gin Gin 
340 

Leu His Leu Gin Met Asp 
355 

Gin Gin Val Ala Lys Leu 
370 

Glu Pro Ser Gly Ala Ala 
385 390 

Leu lie Ala Asp Leu Ala 
405 

Gly Arg Gly Ser Glx 
420 



<210> 7 
<211> 1596 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 7 

atgcgaggta gaactgcatc tggagacgtt gttcctatta acttagaaat tgaagctacg 60 
tgtcggcgta acaacgctgc aagaagaaga agggagcaag acatagaagg aagtagttac 120 
acctcacctc ctccttctcc aaattatgct cagatggacg gggaaccggc acaaagagtc 180 
acactagagg acttctctaa taccaccact cctcagttct ttacaagtat cacaaggccg 240 
gaagtccaag cagatctcct tactcaaggg aacctcttcc atggtcttcc aaatgaagat 300 
ccatatgcgc atctagcctc atacatagag atatgcagca ccgttaaaat cgccggagtt 360 
ccaaaagatg cgatactcct taacctcttt tccttttccc tagcaggaga ggcaaaaaga 420 
tggttgcact cctttaaagg caatagctta agaacatggg aagaagtagt ggaaaaattc 480 
ttaaagaagt atttcccaga gtcaaagacc gtcgaacgaa agatggagat ttcttatttc 540 
catcaatttc tggatgaatc ccttagcgaa gcactagacc atttccacgg attgctaaga 600 
aaaacaccaa cacacagata cagcgagcca gtacaactaa acatattcat cgatgacttg 660 
caactcttaa tcgaaacagc tactagaggg aagatcaagc tgaagactcc cgaagaagcg 720 



Ser Ala Ser Ala Ser Glu Ala Pro Leu Pro 
295 300 

Pro Phe Ser Gin Arg Pro Arg Pro Pro Leu 
315 320 

Pro Tyr Met His Gly Gin Met Leu Arg Ser 
330 335 

lie lie lie Gin Asn Leu Tyr Arg Leu Ser 
345 350 

Leu Pro Leu Met Thr Pro Glu Ala Tyr Arg 
360 365 

Gly Asp Gin Pro Ser Thr Asp Arg Gly Glu 
375 380 

Ala Thr Glu Asp Pro Ala Val Asp Glu Asp 
395 400 

Gly Ala Asp Trp Ser Pro Trp Ala Asp Leu 
410 415 



4 



atggagctcg tcgagaacat ggcggctagc gatcaagcaa tccttcatga tcacacttat 780 
gttcccacaa aaagaagcct cttggagctt agcacgcagg acgcaacttt ggtacaaaac 840 
aagc'tgttga cgaggcagat agaagccctc atcgaaaccc tcagcaagct gcctcaacaa 900 
ttacaagcga taagttcttc ccactcttct gttttgcagg tagaagaatg ccccacatgc 960 
agagggacac atgagcctgg acaatgtgca agccaacaag acccctctcg tgaagtaaat 1020 
tatataggca tactaaatcg ttacggattt cagggctaca accagggaaa tccatctgga 1080 
ttcaatcaag gggcaacaag atttaatcac gagccaccgg ggtttaatca aggaagaaac 1140 
ttcatgcaag gctcaagttg gacgaataaa ggaaatcaat ataaggagca aaggaaccaa 1200 
ccaccatacc agccaccata ccagcaccct agccaaggtc cgaatcagca agaaaagccc 1260 
accaaaatag aggaactgct gctgcaattc atcaaggaga caagatcaca tcaaaagagc 1320 
acggatgcag ccattcggaa tctagaagtt caaatgggcc aactggcgca tgacaaagcc 1380 
gaacggccca ctagaacttt cggtgctaac atggagagaa gaaccccaag gaaggataaa 1440 
gcagtactga ctagagggca gagaagagcg caggaggagg gtaaggttga aggagaagac 1500 
tggccagaag aaggaaggac agagaagaca gaagaagaag agaaggtggc agaagaacct 1560 
aagcgtacca agagccagag agcaagggaa gccaag 1596 

<210> 8 
<211> 532 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 8 

Met Arg Gly Arg Thr Ala Ser Gly Asp Val Val Pro He Asn Leu Glu 
15 10 15 

He Glu Ala Thr Cys Arg Arg Asn Asn Ala Ala Arg Arg Arg Arg Glu 
20 25 30 

Gin Asp He Glu Gly Ser Ser Tyr Thr Ser Pro Pro Pro Ser Pro Asn 
35 40 45 

Tyr Ala Gin Met Asp Gly Glu Pro Ala Gin Arg Val Thr Leu Glu Asp 
50 55 60 

Phe Ser Asn Thr Thr Thr Pro Gin Phe Phe Thr Ser He Thr Arg Pro 
65 70 75 80 

Glu Val Gin Ala Asp Leu Leu Thr Gin Gly Asn Leu Phe His Gly Leu 
85 90 95 

Pro Asn Glu Asp Pro Tyr Ala His Leu Ala Ser Tyr He Glu He Cys 
100 105 HO 

Ser Thr Val Lys He Ala Gly Val Pro Lys Asp Ala He Leu Leu Asn 
115 120 125 

Leu Phe Ser Phe Ser Leu Ala Gly Glu Ala Lys Arg Trp Leu His Ser 



5 



130 



135 



140 



Phe Lys Gly Asn Ser Leu Arg Thr Trp Glu Glu Val Val Glu Lys Phe 
145 150 155 160 

Leu Lys Lys Tyr Phe Pro Glu Ser Lys Thr Val Glu Arg Lys Met Glu 
165 170 175 

lie Ser Tyr Phe His Gin Phe Leu Asp Glu Ser Leu Ser Glu Ala Leu 
180 185 190 

Asp His Phe His Gly Leu Leu Arg Lys Thr Pro Thr His Arg Tyr Ser 
195 200 205 

Glu Pro Val Gin Leu Asn He Phe He Asp Asp Leu Gin Leu Leu He 
210 215 220 

Glu Thr Ala Thr Arg Gly Lys He Lys Leu Lys Thr Pro Glu Glu Ala 
225 230 235 240 

Met Glu Leu Val Glu Asn Met Ala Ala Ser Asp Gin Ala He Leu His 
245 250 255 

Asp His Thr Tyr Val Pro Thr Lys Arg Ser Leu Leu Glu Leu Ser Thr 
260 265 270 

Gin Asp Ala Thr Leu Val Gin Asn Lys Leu Leu Thr Arg Gin He Glu 
27 5 280 285 

Ala Leu He Glu Thr Leu Ser Lys Leu Pro Gin Gin Leu Gin Ala He 
290 295 300 

Ser Ser Ser His Ser Ser Val Leu Gin Val Glu Glu Cys Pro Thr Cys 
305 310 315 320 

Arg Gly Thr His Glu Pro Gly Gin Cys Ala Ser Gin Gin Asp Pro Ser 
325 330 335 

Arg Glu Val Asn Tyr He Gly He Leu Asn Arg Tyr Gly Phe Gin Gly 
340 345 350 

Tyr Asn Gin Gly Asn Pro Ser Gly Phe Asn Gin Gly Ala Thr Arg Phe 
355 360 365 

Asn His Glu Pro Pro Gly Phe Asn Gin Gly Arg Asn Phe Met Gin Gly 
370 375 380 



Ser Ser Trp Thr Asn Lys Gly Asn Gin Tyr Lys Glu Gin Arg Asn Gin 
385 390 395 400 

Pro Pro Tyr Gin Pro Pro Tyr Gin His Pro Ser Gin Gly Pro Asn Gin 



6 



405 



410 



415 



Gin Glu Lys Pro Thr Lys lie Glu 
420 

Glu Thr Arg Ser His Gin Lys Ser 
435 440 

Glu Val Gin Met Gly Gin Leu Ala 
450 455 

Arg Thr Phe Gly Ala Asn Met Glu 
465 470 

Ala Val Leu Thr Arg Gly Gin Arg 
485 

Glu Gly Glu Asp Trp Pro Glu Glu 
500 

Glu Glu Lys Val Ala Glu Glu Pro 
515 520 

Arg Glu Ala Lys 
530 



Glu Leu Leu Leu Gin Phe lie Lys 
425 430 

Thr Asp Ala Ala lie Arg Asn Leu 
445 

His Asp Lys Ala Glu Arg Pro Thr 
460 

Arg Arg Thr Pro Arg Lys Asp Lys 
475 480 

Arg Ala Gin Glu Glu Gly Lys Val 
490 495 

Gly Arg Thr Glu Lys Thr Glu Glu 
505 510 

Lys Arg Thr Lys Ser Gin Arg Ala 
525 



<210> 9 
<211> 603 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 9 

tgtgataaat gccagagaac aggggggata tctcgaagaa atgagatgcc tttgcagaat 60 
atcatggaag tagagatctt tgactgttgg ggcatagact tcatggggcc ttttccttcg 120 
tcatacggga atgtctacat cttggtagct gtggattacg tctccaaatg ggtggaagcc 180 
atagccacgc caaaggacga tgccagggta gtgatcaaat ttctgaagaa gaacattttt 240 
tcccgttttg gagtcccacg agccttgatt agtgataggg gaacgcactt ctgcaacaat 300 
cagttgaaga aagtcctgga gcactataat gtccgacata aggtggccac accttatcac 360 
cctcagacaa atggccaagc agaaatttct aacagggagc tcaagcgaat cctggaaaag 420 
acagttgcat caacaagaaa ggattggtcc ttgaagctcg atgatgctct ctgggcctat 480 
aggacagcgt tcaagactcc catcggctta tcaccatttc agctagtgta tgggaaggca 540 
tgtcatttac cagtggagct ggagtacaaa gcatattggg ctctcaagtt gctcaacttt 600 
gac 603 

<210> 10 
<211> 201 



7 



<212> PRT 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 10 

Cys Asp Lys Cys Gin Arg Thr Gly Gly He Ser Arg Arg Asn Glu Met 
1 5 10 15 

Pro Leu Gin Asn He Met Glu Val Glu He Phe Asp Cys Trp Gly He 
20 25 30 

Asp Phe Met Gly Pro Phe Pro Ser Ser Tyr Gly Asn Val Tyr He Leu 
35 40 45 

Val Ala Val Asp Tyr Val Ser Lys Trp Val Glu Ala He Ala Thr Pro 
50 55 60 

Lys Asp Asp Ala Arg Val Val He Lys Phe Leu Lys Lys Asn He Phe 
65 70 75 80 

Ser Arg Phe Gly Val Pro Arg Ala Leu He Ser Asp Arg Gly Thr His 
85 90 95 

Phe Cys Asn Asn Gin Leu Lys Lys Val Leu Glu His Tyr Asn Val Arg 
100 105 HO 

His Lys Val Ala Thr Pro Tyr His Pro Gin Thr Asn Gly Gin Ala Glu 
115 120 125 

He Ser Asn Arg Glu Leu Lys Arg He Leu Glu Lys Thr Val Ala Ser 
130 135 140 

Thr Arg Lys Asp Trp Ser Leu Lys Leu Asp Asp Ala Leu Trp Ala Tyr 
145 150 155 160 

Arg Thr Ala Phe Lys Thr Pro He Gly Leu Ser Pro Phe Gin Leu Val 
165 170 175 

Tyr Gly Lys Ala Cys His Leu Pro Val Glu Leu Glu Tyr Lys Ala Tyr 
ISO 185 190 

Trp Ala Leu Lys Leu Leu Asn Phe Asp 
195 200 



<210> 11 
<211> 600 



8 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 11 

ttggaggctg ggctcatata ccccatctct gacagcgctt gggtaagccc agtacaggtg 60 
gttcccaaga aaggtggaat gacagtggta cgagatgaga ggaatgactt gataccaaca 120 
cgaactgtca ctggttggcg aatgtgtatc gactatcgca agctgaatga agccacacgg 180 
aaggaccatt tccccttacc tttcatggat cagatgctgg agagacttgc agggcaggca 240 
tactactgtt tcttggatgg atactcggga tacaaccaga tcgcggtaga ccccagagat 300 
caggagaaga cggcctttac atgccccttt ggcgtctttg cttacagaag gatgccattc 360 
gggttatgta atgcaccagc cacatttcag aggtgcatgc tggccatttt ttcagacatg 420 
gtggagaaaa gcatcgaggt atttatggac gacttctcgg tttttggacc ctcatttgac 480 
agctgtttga ggaacctaga gagggtactt cagaggtgcg aagagactaa cttggtactg 540 
aattgggaaa agtgtcattt catggttcga gagggcatag tcctaggcca caagatctca 600 

<210> 12 
<211> 200 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 12 

Leu Glu Ala Gly Leu lie Tyr Pro lie Ser Asp Ser Ala Trp Val Ser 
15 10 15 

Pro Val Gin Val Val Pro Lys Lys Gly Gly Met Thr Val Val Arg Asp 
20 25 30 

Glu Arg Asn Asp Leu lie Pro Thr Arg Thr Val Thr Gly Trp Arg Met 
35 40 45 

Cys He Asp Tyr Arg Lys Leu Asn Glu Ala Thr Arg Lys Asp His Phe 
50 55 60 

Pro Leu Pro Phe Met Asp Gin Met Leu Glu Arg Leu Ala Gly Gin Ala 
65 70 75 80 

Tyr Tyr Cys Phe Leu Asp Gly Tyr Ser Gly Tyr Asn Gin He Ala Val 
85 90 95 

Asp Pro Arg Asp Gin Glu Lys Thr Ala Phe Thr Cys Pro Phe Gly Val 
100 105 110 

Phe Ala Tyr Arg Arg Met Pro Phe Gly Leu Cys Asn Ala Pro Ala Thr 



9 



115 



120 



125 



Phe Gin Arg Cys Met Leu Ala lie 
130 135 

lie Glu Val Phe Met Asp Asp Phe 
145 150 

Ser Cys Leu Arg Asn Leu Glu Arg 
165 

Asn Leu Val Leu Asn Trp Glu Lys 
180 

lie Val Leu Gly His Lys lie Ser 

195 200 



Phe Ser Asp Met Val Glu Lys Ser 
140 

Ser Val Phe Gly Pro Ser Phe Asp 
155 160 

Val Leu Gin Arg Cys Glu Glu Thr 
170 175 

Cys His Phe Met Val Arg Glu Gly 
185 190 



<210> 13 
<211> 858 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 13 

aaggaagaac cactagccct tccacaggat ctcccatatc ctatggcacc caccaagaag 60 
aacaaggagc gttactttgc acgtttcttg gaaatattca aagggttaga aatcactatg 120 
ccattcgggg aagccttaca gcagatgccc ctctactcca aatttatgaa agacatcctc 180 
accaagaagg ggaagtatat tgacaacgag aatattgtgg taggaggcaa ttgcagtgcg 240 
ataatacaaa ggattctacc caagaagttt aaagaccccg gaagtgttac catcccgtgc 300 
accattggga aggaagccgt aaacaaggcc ctcattgatc taggagcaag tatcaatctg 360 
atgcccttgt caatgtgcaa aagaattggg aatttgaaga tagatcccac caagatgacg 420 
cttcaactgg cagaccgctc aatcacaagg ccatatgggg tggtagaaga tgtcctggtc 480 
aaggtacgcc acttcacttt tccggtggac tttgttatca tggatatcga agaagacact 540 
gagattcccc ttatcttagg cagacccttc atgctgactg ccaactgtgt ggtggatatg 600 
gggaaaggga acttagagtt gactattgat aatcagaaga tcacctttga ccttatcaag 660 
gcaatgaagt acccacagga gggttggaag tgcttcagaa tagaggagat tgatgaggaa 720 
gatgtcagtt ttctcgagac accaaagact tcgctagaaa aagcaatggt aaatcattta 780 
gactgtctaa ccagtgaaga ggaagaagat ctgaaggctt gcttggaaaa cttggatcaa 840 
gaagacagta ttcctgag 353 

<210> 14 
<211> 286 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 



10 



retroelement sequence 



<400> 14 

Lys Glu Glu Pro Leu Ala Leu Pro Gin Asp Leu Pro Tyr Pro Met Ala 
1 5 10 15 

Pro Thr Lys Lys Asn Lys Glu Arg Tyr Phe Ala Arg Phe Leu Glu lie 
20 25 30 

Phe Lys Gly Leu Glu lie Thr Met Pro Phe Gly Glu Ala Leu Gin Gin 
35 40 45 

Met Pro Leu Tyr Ser Lys Phe Met Lys Asp lie Leu Thr Lys Lys Gly 
50 55 60 

Lys Tyr He Asp Asn Glu Asn He Val Val Gly Gly Asn Cys Ser Ala 
65 70 75 80 

He He Gin Arg He Leu Pro Lys Lys Phe Lys Asp Pro Gly Ser Val 
85 90 95 

Thr He Pro Cys Thr He Gly Lys Glu Ala Val Asn Lys Ala Leu He 
100 105 HO 

Asp Leu Gly Ala Ser He Asn Leu Met Pro Leu Ser Met Cys Lys Arg 
115 120 125 

He Gly Asn Leu Lys He Asp Pro Thr Lys Met Thr Leu Gin Leu Ala 
130 135 140 

Asp Arg Ser He Thr Arg Pro Tyr Gly Val Val Glu Asp Val Leu Val 
145 150 155 160 

Lys Val Arg His Phe Thr Phe Pro Val Asp Phe Val He Met Asp He 
165 170 175 

Glu Glu Asp Thr Glu He Pro Leu He Leu Gly Arg Pro Phe Met Leu 
180 185 190 

Thr Ala Asn Cys Val Val Asp Met Gly Lys Gly Asn Leu Glu Leu Thr 
195 200 205 

He Asp Asn Gin Lys He Thr Phe Asp Leu He Lys Ala Met Lys Tyr 
210 215 220 

Pro Gin Glu Gly Trp Lys Cys Phe Arg He Glu Glu He Asp Glu Glu 

225 230 235 240 

Asp Val Ser Phe Leu Glu Thr Pro Lys Thr Ser Leu Glu Lys Ala Met 
245 250 255 



11 



Val Asn His Leu Asp Cys Leu Thr Ser Glu Glu Glu Glu Asp Leu Lys 
260 265 270 



Ala Cys Leu Glu Asn Leu Asp Gin Glu Asp Ser lie Pro Glu 
275 280 285 



<210> 15 
<211> 192 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 15 

tttgaactaa tgtgtgatgc cagtgattat gcagtaggag cagttttggg acagaggaaa 60 
gacaaggtat ttcacgccat ctattatgct agcaaggtcc tgaatgaagc acagttgaat 120 
tatgcaacca cagaaaagga gatgctagcc attgtctttg ccttggagaa gttcaggtca 180 
tacttgatag gg 192 

<210> 16 
<211> 64 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 16 

Phe Glu Leu Met Cys Asp Ala Ser Asp Tyr Ala Val Gly Ala Val Leu 
15 10 15 

Gly Gin Arg Lys Asp Lys Val Phe His Ala He Tyr Tyr Ala Ser Lys 
20 25 30 

Val Leu Asn Glu Ala Gin Leu Asn Tyr Ala Thr Thr Glu Lys Glu Met 
35 40 45 

Leu Ala He Val Phe Ala Leu Glu Lys Phe Arg Ser Tyr Leu He Gly 
50 55 60 



<210> 17 
<211> 12286 



12 



<212> DNA 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; plant 
retroelement sequence 

<400> 17 

tgataactgc taaataattg tgaattaata gtagaaaatt agtcaaattt tggcttaaaa 60 
ttaattattt agcagttatt tgtgattaaa agttagaaaa gcaattaagt tgaatttttg 120 
gccatagata tgaaaactga aggtacaaca agcaaaaggc agcagaaagt gaagaaaaag 180 
aataaaatct gaagcagacc cagcccaaca cgcgccctta gcgcgcgtca cgcgctaagc 240 
ttgcaaggca gcacaggcac taagcgaggc gttaagcacg aagatgcagg attcgttacg 300 
tgcgctaagc gcgaggcaca cgctaagcgc gcgatccaac agaagcacac gctaagcctg 360 
cagcatgcgc taagcgcgcc tacgaaggcc caaagcccat ttctacacct ataaatagag 420 
atccaagcca agggagaatg tacaccttgc ctcagagcac ttctctcagc attccaagct 480 
tgagctctcc cttttctctc tatattcttt gcttttatta tccattcttt ctttcacccc 540 
agttgtaaag cccctcaatg gccatgagtg gttaatcccc tagctacggc ctggtaggcc 600 
taaaaagcca atgatgtatg gtgtacttca agagttatca atgcaaagag gattcattcc 660 
aggttttatg ttctaattct ttccttttta tcttgcattt atgtcttaaa tttctgttgg 720 
gttttattcg ctcgggagag ggtatttcct aataagggtt taagaagtaa tgcatgcatc 780 
agttttaggg gttatacgct tggtaaaggg taacacctaa tagaacaaat taagaaaagg 840 
atcgtcgggc tagcattgct aggcatagaa tgatggccca atgcccatgc atttagcaac 900 
atctagaatt taaccttaat gcattttaat tattgaatct tcacaaaggc atttgggaga 960 
taggtagtta aaataggctt gtcatcgtga ggcatcaagg gcaagtaaaa ttaatagatg 1020 
tgggtagaac taattcaact gcattggtaa tgaacatcat aaattcattc atcgtaggcc 1080 
aattaggttt gtccggtctt ggcattttca tcaattgtct tcctaaatta tttgatctaa 1140 
tagcaacaat ttattcttat gcctattcct gtttttacta tttactttta cttacaaatt 1200 
gaagagtatt caataaagtg caataaaatc cctatggaaa cgatactcgg acttccgaga 1260 
attactactt agaacgattt ggtacacttg tcaaacacct caacaagttt ttggcgccgt 1320 
tgtcggggat tttgttctcg cacttaattg ccatactata ttagtttgta agcttaattc 1380 
ttcttttctt ggctcattct tttattattc tttactttac tttttcttct atcctttctt 1440 
tcttctccca taaattgcac gggtagtgcc tttttgtttt tatgcgaggt agaactgcat 1500 
ctggagacgt tgttcctatt aacttagaaa ttgaagctac gtgtcggcgt aacaacgctg 1560 
caagaagaag aagggagcaa gacatagaag gaagtagtta cacctcacct cctccttctc 1620 
caaattatgc tcagatggac ggggaaccgg cacaaagagt cacactagag gacttctcta 1680 
ataccaccac tcctcagttc tttacaagta tcacaaggcc ggaagtccaa gcagatctcc 1740 
ttactcaagg gaacctcttc catggtcttc caaatgaaga tccatatgcg catctagcct 1800 
catacataga gatatgcagc accgttaaaa tcgccggagt tccaaaagat gcgatactcc 1860 
ttaacctctt ttccttttcc ctagcaggag aggcaaaaag atggttgcac tcctttaaag 1920 
gcaatagctt aagaacatgg gaagaagtag tggaaaaatt cttaaagaag tatttcccag 1980 
agtcaaagac cgtcgaacga aagatggaga tttcttattt ccatcaattt ctggatgaat 2040 
cccttagcga agcactagac catttccacg gattgctaag aaaaacacca acacacagat 2100 
acagcgagcc agtacaacta aacatattca tcgatgactt gcaactctta atcgaaacag 2160 
ctactagagg gaagatcaag ctgaagactc ccgaagaagc gatggagctc gtcgagaaca 2220 
tggcggctag cgatcaagca atccttcatg atcacactta tgttcccaca aaaagaagcc 2280 
tcttggagct tagcacgcag gacgcaactt tggtacaaaa caagctgttg acgaggcaga 2340 
tagaagccct catcgaaacc ctcagcaagc tgcctcaaca attacaagcg ataagttctt 2400 
cccactcttc tgttttgcag gtagaagaat gccccacatg cagagggaca catgagcctg 2460 
gacaatgtgc aagccaacaa gacccctctc gtgaagtaaa ttatataggc atactaaatc 2520 
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gttacggatt tcagggctac aaccagggaa atccatctgg attcaatcaa ggggcaacaa 2580 
gatttaatca cgagccaccg gggtttaatc aaggaagaaa cttcatgcaa ggctcaagtt 2640 
ggacgaataa aggaaatcaa tataaggagc aaaggaacca accaccatac cagccaccat 2700 
accagcaccc tagccaaggt ccgaatcagc aagaaaagcc caccaaaata gaggaactgc 2760 
tgctgcaatt catcaaggag acaagatcac atcaaaagag cacggatgca gccattcgga 2820 
atctagaagt tcaaatgggc caactggcgc atgacaaagc cgaacggccc actagaactt 2880 
tcggtgctaa catggagaga agaaccccaa ggaaggataa agcagtactg actagagggc 2940 
agagaagagc gcaggaggag ggtaaggttg aaggagaaga ctggccagaa gaaggaagga 3000 
cagagaagac agaagaagaa gagaaggtgg cagaagaacc taagcgtacc aagagccaga 3060 
gagcaaggga agccaagaag gaagaaccac tagcccttcc acaggatctc ccatatccta 3120 
tggcacccac caagaagaac aaggagcgtt actttgcacg tttcttggaa atattcaaag 3180 
ggttagaaat cactatgcca ttcggggaag ccttacagca gatgcccctc tactccaaat 3240 
ttatgaaaga catcctcacc aagaagggga agtatattga caacgagaat attgtggtag 33 00 
gaggcaattg cagtgcgata atacaaagga ttctacccaa gaagtttaaa gaccccggaa 3360 
gtgttaccat cccgtgcacc attgggaagg aagccgtaaa caaggccctc attgatctag 3420 
gagcaagtat caatctgatg cccttgtcaa tgtgcaaaag aattgggaat ttgaagatag 3480 
atcccaccaa gatgacgctt caactggcag accgctcaat cacaaggcca tatggggtgg 3540 
tagaagatgt cctggtcaag gtacgccact tcacttttcc ggtggacttt gttatcatgg 3 600 
atatcgaaga agacactgag attcccctta tcttaggcag acccttcatg ctgactgcca 3660 
actgtgtggt ggatatgggg aaagggaact tagagttgac tattgataat cagaagatca 3720 
cctttgacct tatcaaggca atgaagtacc cacaggaggg ttggaagtgc ttcagaatag 3780 
aggagattga tgaggaagat gtcagttttc tcgagacacc aaagacttcg ctagaaaaag 3840 
caatggtaaa tcatttagac tgtctaacca gtgaagagga agaagatctg aaggcttgct 3900 
tggaaaactt ggatcaagaa gacagtattc ctgagggaga agccaatttc gaggagctag 3960 
agaaggaagt tccgtctgag aagccgaaga tagagttgaa gatattgcct gatcatctga 4020 
agtatgtgtt cttggaggaa gataaaccta tagtgatcag taacgcactc acaacagagg 4080 
aggaaaatag gttggtagat gtcctcaaga aacacaggga agcaattgga tggcacatat 4140 
cggatctcaa ggaaattagc cctgcttact gcatgcacag gataatgatg gaagaggact 4200 
acaagccagt ccgacaaccc cagaggcggc tgaatccaac aatgaaggaa gaggtaagaa 4260 
aggaggtact caagctcttg gaggctgggc tcatataccc catctctgac agcgcttggg 4320 
taagcccagt acaggtggtt cccaagaaag gtggaatgac agtggtacga gatgagagga 4380 
atgacttgat accaacacga actgtcactg gttggcgaat gtgtatcgac tatcgcaagc 4440 
tgaatgaagc cacacggaag gaccatttcc ccttaccttt catggatcag atgctggaga 4500 
gacttgcagg gcaggcatac tactgtttct tggatggata ctcgggatac aaccagatcg 4560 
cggtagaccc cagagatcag gagaagacgg cctttacatg cccctttggc gtctttgctt 4620 
acagaaggat gccattcggg ttatgtaatg caccagccac atttcagagg tgcatgctgg 4680 
ccattttttc agacatggtg gagaaaagca tcgaggtatt tatggacgac ttctcggttt 4740 
ttggaccctc atttgacagc tgtttgagga acctagagag ggtacttcag aggtgcgaag 4800 
agactaactt ggtactgaat tgggaaaagt gtcatttcat ggttcgagag ggcatagtcc 4860 
taggccacaa gatctcagcc agagggattg aggttgatcg ggcaaagata gacgtcatcg 4920 
agaagctgcc accaccactg aatgttaaag gggttagaag tttcttaggg catgcaggtt 4980 
tctacaggag gtttatcaag gacttctcga agattgccag gcccttaagc aatctgttga 5040 
ataaagacgt ggcttttgtg tttgatgaag aatgtttagc agcatttcaa tcactgaaga 5100 
ataagctcgt cactgcaccc gtaatgattg cacccgactg gaataaagat tttgaactaa 5160 
tgtgtgatgc cagtgattat gcagtaggag cagttttggg acagaggaaa gacaaggtat 5220 
ttcacgccat ctattatgct agcaaggtcc tgaatgaagc acagttgaat tatgcaacca 5280 
cagaaaagga gatgctagcc attgtctttg ccttggagaa gttcaggtca tacttgatag 5340 
ggtcgagggt catcatttac acagatcatg ctgccatcaa gcacctgctc gccaaaacag 5400 
actcaaagcc gaggttgatt agatgggtcc tgctgttaca agaatttgac atcatcatca 5460 
aggacaagaa aggatccgag aatgtggtag ccaatcatct atctcgatta aagaatgaag 5520 
aagtcaccaa ggaagaacca gaggtaaaag gtgaatttcc tgatgagttt cttttgcagg 5580 
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ttaccgaaag accttggttt gcagacatgg ctaactacaa agccacggga gtcattccag 5640 
aggagtttaa ttggagtcag aggaagaaat tcttgcacga tgcacgcttc tatgtgtggg 5700 
atgatcctca tttgttcaag gcaggagcag ataatttatt aaggagatgc gtcacaaagg 5760 
aggaagcacg gagcattctt tggcactgcc acagttcacc ctatggcgga caccacagtg 5820 
gggacagaac agcagcaaaa gtgctacaat caggtttttt ctggccctct atttttaaag 5880 
atgctcacga gtttgtgcgt tgttgtgata aatgccagag aacagggggg atatctcgaa 5940 
gaaatgagat gcctttgcag aatatcatgg aagtagagat ctttgactgt tggggcatag 6000 
acttcatggg gccttttcct tcgtcatacg ggaatgtcfca catcttggta gctgtggatt 6060 
acgtctccaa atgggtggaa gccatagcca cgccaaagga cgatgccagg gtagtgatca 6120 
aatttctgaa gaagaacatt ttttcccgtt ttggagtccc acgagccttg attagtgata 6180 
ggggaacgca cttctgcaac aatcagttga agaaagtcct ggagcactat aatgtccgac 6240 
ataaggtggc cacaccttat caccctcaga caaatggcca agcagaaatt tctaacaggg 6300 
agctcaagcg aatcctggaa aagacagttg catcaacaag aaaggattgg tccttgaagc 6360 
tcgatgatgc tctctgggcc tataggacag cgttcaagac tcccatcggc ttatcaccat 6420 
ttcagctagt gtatgggaag gcatgtcatt taccagtgga gctggagtac aaagcatatt 6480 
gggctctcaa gttgctcaac tttgacaaca acgcatgcgg ggaaaagagg aagctacagc 6540 
tgctggaatt agaagagatg agactgaatg cctacgagtc atccaaaatt tacaaggaaa 6600 
agatgaaggc atatcatgac aagaagctac tgaggaaaga attccagcca gggcagcagg 6660 
tattactctt taactcaagg ctaaggctat tcccaggtaa gctgaagtcc aagtggtcag 6720 
ggccattcat aatcaaagaa gtcagacctt acggagcagt agaattggtg gaccctagag 6780 
aagaggactt tgagaagaaa tggatcgtca atggacagcg cttgaagcct tataacggag 6840 
gacaactaga gcgattgacg accatcatct acttaaatga cccttgagaa ggcctactgt 6900 
ctagctaaag acaataaact aagcgctggt tgggaggcaa cccaacatat tttgtaaaaa 6960 
tgtagttatc tttattctat gtaaaaaaaa aaaaaaagcc caataggtgc aaataggaaa 7020 
caggaggtgc aaaaagcaaa ggcccaacag gtgaagacaa caataggagg ggtgccaata 7080 
gcaaaactga agtgggctgc acgaagccac gcgcccaatt cttggtcttt tcacacaaaa 7140 
caatcactaa cgaaggtaaa gaattgcttt gtatggatgt tgttatgaat gcacaggtaa 7200 
cagcacgcta agccctgctc gacgcttagc caatgaagac ggattgaagg ccataacgac 7260 
gagctcgtta agcgtgacga agcacgctaa gcaggcgcct gacaggacga gaaagcaaag 7320 
cgcgcgctta gccggcactt ccgcgctaag cgcgctcatg aacatcactg aacgcgctaa 7380 
acgtgtgcca gaggcgctaa acgcgtgcca gaggcgctaa acgcgtgcat tagtcacagc 7440 
aggatggtgc taagcgcggg gttgggcctc agggcccatc aaccctcgca ccttacttgt 7500 
tgcaccccta tttctactat tcccactccc ttctaatttc tttttgcacc ccccttcttt 7560 
actgactgca cctctatttt gattactttt tgcacccccc ctgattgcta acttcagact 7620 
atctttcttg ttttttgttt ttttggtttt ttggtcagat ggcctcccgt aaacgcaaag 7680 
ctgtgcccac acccggggaa gcgtccaact gggactcttc acgtttcact ttcgagattg 7740 
cttggcacag ataccaggat agcattcagc tccggaacat ccttccagag aggaatgtag 7800 
agcttggacc agggatgttt gatgagttcc tgcaggaact ccagaggctc agatgggacc 7860 
aggttctgac ccgacttcca gagaagtgga ttgatgttgc tctggtgaag gagttttact 7920 
ccaacctata tgatccagag gaccacagtc cgaagttttg gagtgttcga ggacaggttg 7980 
tgagatttga tgctgagacg attaafcgatt tcctcgacac cccggtcatc ttggcagagg 8040 
gagaggatta tccagcctac tctcagtacc tcagcactcc tccagaccat gatgccatcc 8100 
tttccgctct gtgtactcca gggggacgat ttgttctgaa tgttgatagt gccccctgga 8160 
agctgctgcg gaaggatctg atgacgctcg cgcagacatg gagtgtgctc tcttatttta 8220 
accttgcact gacttttcac acttctgata ttaatgttga cagggcccga ctcaattatg 8280 
gcttggtgat gaagatggac ctggacgtgg gcagcctcat ttctcttcag atcagtcaga 8340 
tcgccoagtc catcacttcc aggcttgggt tcccagcgtt gatcacaaca ctgtgtgaga 8400 
ttcagggggt tgtctctgat accctgattt ttgagtcact cagtcctgtg atcaaccttg 8460 
cctacattaa gaagaactgc tggaaccctg ccgatccatc tatcacattt caggggaccc 8520 
gccgcacgcg caccagagct tcggcgtcgg catctgaggc tcctcttcca tcccagcatc 8580 
cttctcagca tttttcccag agaccacggc ctccacttct atccacctca gcacctccat 8640 
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acatgcatgg acagatgctc aggtccttgt accagggtca gcagatcatc attcagaacc 8700 

tgtatcgatt gtccctacat ttgcagatgg atctgccact catgactccg gaggcctatc 8760 
gtcagcaggt cgccaagcta ggagaccagc cctccactga caggggggaa gagccttctg 8820 
gagccgctgc tactgaggat cctgccgttg atgaagacct catagctgac ttggctggcg 8880 
ctgattggag cccatgggca gacttgggca gaggcagctg atcttatgct ttaatgtttt 8940 
cttttatatt atgtttgtgt tctcttttat gttttatgtt atgtttttat gtagtctgtt 9000 
tggtaattaa aaagaggtag tagtaaaaat attagtattt cagtatgtgt tttctgagta 9060 
ataagtgcat gataactcaa gcaatcataa ttctttagct tgttcagaaa ggttcaacac 9120 
ttgagatgcc actgatcctt ggagaaacac tggttctgga agcaaaagtc aggtcaagaa 9180 
atggaacatg aatagcacag agtggaaagg ttagcttgat ggaacaaggt cataactggt 9240 
acgccgaata cttgtttaag tccctgtgag catggttgtc aaactctaga gtcaactcat 9300 
agactctcat gagtttaaga gtttacttca gtcccgcgag ttgactcgga agcaaactcg 9360 
cttttgagca aactcgtgga ctcggagtga actcatgtaa actcgtaaga gtctacgagt 9420 
tgactctaga gtttgacaac catgcataag tgttcaaaat taaagcattt aaataattaa 9480 
aaaaagcaca aatgtcttca aagaagcatg ttcaatcctc taataggatc atcttcatga 9540 
atatcatcac tttcatcatc atctccatct ccatcatcat catcaaggtc ttcctcagat 9600 
tgtgcatcat cattaggttc cacaaagatt aaattatcta gatcaaaagc ttaaaataga 9660 
tatcaaatat gctatattag aaatagttaa aacttaaaat aatacacaag caaattttaa 9720 
atatgagaaa gttcagaaat tatacctttt cttggtgtta ttaaagtttc attttatctt 9780 
ctcttttgca ttttccatct cctcacatat gaaaagcata attctattga atttcagtaa 9840 
caagtttgat ccaactccaa cattgtaagg tcagttgttg tgttttgtaa tagactaata 9900 
tgaagtatga agtatgaact atgaacttat tgtcatctgt ttgcaaattg gtgcattttg 9960 
aatatattta cttattatcc attttttttt ttttacgaag tagactctca cgagtctgcg 10020 
tagactctcg atatcgataa ccttgccgat gagagtgtga acttaattgt gagagaaaat 10080 
gcctattttt aagttcctgg ttttgcatca ttcttagacg gttagaatag ttacttaagg 10140 
tggatatgat caaggccatg tttgtttgtt tacctactta gccaaaaagc caacctaaca 10200 
tagttttacc ccttgcaccc atgattgagc caactgatta ttttgaatta accttgagcc 10260 
aattaaacaa aatcctgacc ttttaggatt ttaagagagt aaaaatgggt tataaaggtc 10320 
ttaatttggg ggattttggg aaataggtag ccaagacaat aagtacagca cacaaagtag 10380 
gacacctttt acaaacagta ggcccaattt cgaaaaaaaa atgaaaagaa tttaataaag 10440 
ggcagaaaca aaagagcaag agaggtgtca aaagaaaagt gttgtgggga aataaaaggg 10500 
ctaagtaaaa aggcctaggc agaattggaa atttttgttc tcttttaatc ctaactttga 10560 
atttccaaga aaaaccatga ttttttgtaa gccaggcccc gatacaagcc aataaagtcc 10620 
ttagtgatcc accaaaggta actagagata actgtaactg agatgaaatg caaaattttg 10680 
aagtgttact tgcaggttgt tatcaaattg caaacactaa actaggcact tgtgagcaga 10740 
gggaaacacc agccttgtga ggaaagtaag gcaagccaaa tttgattgag ttccagatga 10800 
ctaactgatt caattcttct gttgtaatgc tttcatttta agatgttgac agatgcagaa 10860 
aggaccagtg aaagaaggag gaactgagcc attgatagtg ttggaatatt taagaacttg 10920 
cttgagaatt tacttgtttt tggttttctt ggggacaagc aaagtttcat ttggggaatt 10980 
ttgataactg ctaaataatt gtgaattaat agtagaaaat tagtcaaatt ttggcttaaa 11040 
attaattatt tagcagttat ttgtgattaa aagttagaaa agcaattaag ttgaattttt 11100 
ggccatagat atgaaaactg aaggtacaac aagcaaaagg cagcagaaag tgaagaaaaa 11160 
gaataaaatc tgaagcagac ccagcccaac acgcgccctt agcgcgcgtc acgcgctaag 11220 
cttgcaaggc agcacaggca ctaagcgagg cgttaagcac gaagatgcag gattcgttac 11280 
gtgcgctaag cgcgaggcac acgctaagcg cgcgatccaa cagaagcaca cgctaagcct 11340 
gcagcatgcg ctaagcgcgc ctacgaaggc ccaaagccca tttctacacc tataaataga 11400 
gatccaagcc aagggagaat gtacaccttg cctcagagca cttctctcag cattccaagc 11460 
ttgagctctc ccttttctct ctatattctt tgcttttatt atccattctt tcttfccaccc 11520 
cagttgtaaa gcccctcaat ggccatgagt ggttaatccc ctagctacgg cctggtaggc 11580 
ctaaaaagcc aatgatgtat ggtgtacttc aagagttatc aatgcaaaga ggattcattc 11640 
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caggttttat gttctaattc tttccttttt atcttgcafct tatgtcttaa atttctgttg 11700 , 
ggttttattc gctcgggaga gggtatttcc taataagggt ttaagaagta atgcatgcat 11760 
cagttttagg ggttatacgc ttggtaaagg gtaacaccta atagaacaaa ttaagaaaag 11820 
gatcgtcggg ctagcattgc taggcataga atgatggccc aatgcccatg catttagcaa 11880 
catctagaat ttaaccttaa tgcattttaa ttattgaatc ttcacaaagg catttgggag 11940 
ataggtagtt aaaataggct tgtcatcgtg aggcatcaag ggcaagtaaa attaatagat 12000 
gtgggtagaa ctaattcaac tgcattggta atgaacatca taaattcatt catcgtaggc 12060 
caattaggtt tgtccggtct tggcattttc atcaattgtc ttcctaaatt atttgatcta 12120 
atagcaacaa tttattctta tgcctattcc tgtttttact atttactttt acttacaaat 12180 
tgaagagtat tcaataaagt gcaataaaat ccctatggaa acgatactcg gacttccgag 12240 
aattactact tagaacgatt tggtacactt gtcaaacacc tcaaca 12286 

<210> 18 
<211> 1802 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plant 
retroelement sequence 

<400> 18 

Met Arg Gly Arg Thr Ala Ser Gly Asp Val Val Pro lie Asn Leu Glu 
15 10 15 

lie Glu Ala Thr Cys Arg Arg Asn Asn Ala Ala Arg Arg Arg Arg Glu 
20 25 30 

Gin Asp lie Glu Gly Ser Ser Tyr Thr Ser Pro Pro Pro Ser Pro Asn 
35 40 45 

Tyr Ala Gin Met Asp Gly Glu Pro Ala Gin Arg Val Thr Leu Glu Asp 
50 55 60 

Phe Ser Asn Thr Thr Thr Pro Gin Phe Phe Thr Ser He Thr Arg Pro 
65 70 75 80 

Glu Val Gin Ala Asp Leu Leu Thr Gin Gly Asn Leu Phe His Gly Leu 
85 90 95 

Pro Asn Glu Asp Pro Tyr Ala His Leu Ala Ser Tyr He Glu He Cys 
100 105 110 

Ser Thr Val Lys He Ala Gly Val Pro Lys Asp Ala He Leu Leu Asn 
115 120 125 

Leu Phe Ser Phe Ser Leu Ala Gly Glu Ala Lys Arg Trp Leu His Ser 
130 135 140 

Phe Lys Gly Asn Ser Leu Arg Thr Trp Glu Glu Val Val Glu Lys Phe 
145 150 155 160 
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Leu Lys Lys Tyr Phe Pro Glu Ser Lys Thr Val Glu Arg Lys Met Glu 
165 170 175 



He Ser Tyr Phe His Gin Phe Leu Asp Glu Ser Leu Ser Glu Ala Leu 
180 185 190 

Asp His Phe His Gly Leu Leu Arg Lys Thr Pro Thr His Arg Tyr Ser 
195 200 205 

Glu Pro Val Gin Leu Asn He Phe He Asp Asp Leu Gin Leu Leu He 
210 215 220 

Glu Thr Ala Thr Arg Gly Lys He Lys Leu Lys Thr Pro Glu Glu Ala 
225 230 235 240 

Met Glu Leu Val Glu Asn Met Ala Ala Ser Asp Gin Ala He Leu His 
245 250 255 

Asp His Thr Tyr Val Pro Thr Lys Arg Ser Leu Leu Glu Leu Ser Thr 
260 265 270 

Gin Asp Ala Thr Leu Val Gin Asn Lys Leu Leu Thr Arg Gin He Glu 
275 280 285 

Ala Leu He Glu Thr Leu Ser Lys Leu Pro Gin Gin Leu Gin Ala He 
290 295 300 

Ser Ser Ser His Ser Ser Val Leu Gin Val Glu Glu Cys Pro Thr Cys 
305 310 315 320 

Arg Gly Thr His Glu Pro Gly Gin Cys Ala Ser Gin Gin Asp Pro Ser 
325 330 335 

Arg Glu Val Asn Tyr He Gly He Leu Asn Arg Tyr Gly Phe Gin Gly 
340 345 350 

Tyr Asn Gin Gly Asn Pro Ser Gly Phe Asn Gin Gly Ala Thr Arg Phe 
355 360 365 

Asn His Glu Pro Pro Gly Phe Asn Gin Gly Arg Asn Phe Met Gin Gly 
370 375 380 

Ser Ser Trp Thr Asn Lys Gly Asn Gin Tyr Lys Glu Gin Arg Asn Gin 
385 390 395 400 

Pro Pro Tyr Gin Pro Pro Tyr Gin His Pro Ser Gin Gly Pro Asn Gin 
405 410 415 



Gin Glu Lys Pro Thr Lys He Glu Glu Leu Leu Leu Gin Phe He Lys 
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420 



425 



430 



Glu Thr Arg Ser His Gin Lys Ser Thr Asp Ala Ala He Arg Asn Leu 
435 t 440 445 

Glu Val Gin Met Gly Gin Leu Ala His Asp Lys Ala Glu Arg Pro Thr 
450 455 460 

Arg Thr Phe Gly Ala Asn Met Glu Arg Arg Thr Pro Arg Lys Asp Lys 
465 470 475 480 

Ala Val Leu Thr Arg Gly Gin Arg Arg Ala Gin Glu Glu Gly Lys Val 
485 490 495 

Glu Gly Glu Asp Trp Pro Glu Glu Gly Arg Thr Glu Lys Thr Glu Glu 
500 505 510 

Glu Glu Lys Val Ala Glu Glu Pro Lys Arg Thr Lys Ser Gin Arg Ala 
515 520 525 

Arg Glu Ala Lys Lys Glu Glu Pro Leu Ala Leu Pro Gin Asp Leu Pro 
530 535 540 

Tyr Pro Met Ala Pro Thr Lys Lys Asn Lys Glu Arg Tyr Phe Ala Arg 
545 550 555 560 

Phe Leu Glu He Phe Lys Gly Leu Glu He Thr Met Pro Phe Gly Glu 
565 570 575 

Ala Leu Gin Gin Met Pro Leu Tyr Ser Lys Phe Met Lys Asp He Leu 
580 585 590 

Thr Lys Lys Gly Lys Tyr He Asp Asn Glu Asn He Val Val Gly Gly 
595 600 605 

Asn Cys Ser Ala He He Gin Arg He Leu Pro Lys Lys Phe Lys Asp 
610 615 620 

Pro Gly Ser Val Thr He Pro Cys Thr He Gly Lys Glu Ala Val Asn 
625 630 635 640 

Lys Ala Leu He Asp Leu Gly Ala Ser He Asn Leu Met Pro Leu Ser 
645 650 655 

Met Cys Lys Arg He Gly Asn Leu Lys He Asp Pro Thr Lys Met Thr 
660 665 670 

Leu Gin Leu Ala Asp Arg Ser He Thr Arg Pro Tyr Gly Val Val Glu 
675 680 685 
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Asp Val Leu Val Lys Val Arg His Phe Thr Phe Pro Val Asp Phe Val 
690 695 700 



lie Met Asp He Glu Glu Asp Thr Glu He Pro Leu He Leu Gly Arg 
705 710 715 720 

Pro Phe Met Leu Thr Ala Asn Cys Val Val Asp Met Gly Lys Gly Asn 
725 730 735 

Leu Glu Leu Thr He Asp Asn Gin Lys He Thr Phe Asp Leu He Lys 
740 745 750 

Ala Met Lys Tyr Pro Gin Glu Gly Trp Lys Cys Phe Arg He Glu Glu 
755 760 765 

He Asp Glu Glu Asp Val Ser Phe Leu Glu Thr Pro Lys Thr Ser Leu 
770 775 780 

Glu Lys Ala Met Val Asn His Leu Asp Cys Leu Thr Ser Glu Glu Glu 
785 790 795 800 

Glu Asp Leu Lys Ala Cys Leu Glu Asn Leu Asp Gin Glu Asp Ser He 
805 810 815 

Pro Glu Gly Glu Ala Asn Phe Glu Glu Leu Glu Lys Glu Val Pro Ser 
820 825 830 

Glu Lys Pro Lys He Glu Leu Lys He Leu Pro Asp His Leu Lys Tyr 
835 840 845 

Val Phe Leu Glu Glu Asp Lys Pro He Val He Ser Asn Ala Leu Thr 
850 855 860 

Thr Glu Glu Glu Asn Arg Leu Val Asp Val Leu Lys Lys His Arg Glu 
865 870 875 880 

Ala He Gly Trp His He Ser Asp Leu Lys Glu He Ser Pro Ala Tyr 
885 890 895 

Cys Met His Arg He Met Met Glu Glu Asp Tyr Lys Pro Val Arg Gin 
900 905 910 

Pro Gin Arg Arg Leu Asn Pro Thr Met Lys Glu Glu Val Arg Lys Glu 
915 920 925 

Val Leu Lys Leu Leu Glu Ala Gly Leu He Tyr Pro He Ser Asp Ser 
930 935 940 



Ala Trp Val Ser Pro Val Gin Val Val Pro Lys Lys Gly Gly Met Thr 
945 950 955 960 
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Val Val Arg Asp Glu Arg Asn Asp Leu lie Pro Thr Arg Thr Val Thr 
965 970 975 



Gly Trp Arg Met Cys lie Asp Tyr Arg Lys Leu Asn Glu Ala Thr Arg 
980 985 990 

Lys Asp His Phe Pro Leu Pro Phe Met Asp Gin Met Leu Glu Arg Leu 
995 1000 1005 

Ala Gly Gin Ala Tyr Tyr Cys Phe Leu Asp Gly Tyr Ser Gly Tyr Asn 
1010 1015 1020 

Gin lie Ala Val Asp Pro Arg Asp Gin Glu Lys Thr Ala Phe Thr Cys 
1025 1030 1035 1040 

Pro Phe Gly Val Phe Ala Tyr Arg Arg Met Pro Phe Gly Leu Cys Asn 
1045 1050 1055 

Ala Pro Ala Thr Phe Gin Arg Cys Met Leu Ala lie Phe Ser Asp Met 
1060 1065 1070 

Val Glu Lys Ser lie Glu Val Phe Met Asp Asp Phe Ser Val Phe Gly 
1075 1080 1085 

Pro Ser Phe Asp Ser Cys Leu Arg Asn Leu Glu Arg Val Leu Gin Arg 
1090 1095 1100 

Cys Glu Glu Thr Asn Leu Val Leu Asn Trp Glu Lys Cys His Phe Met 
1105 1110 1115 1120 

Val Arg Glu Gly lie Val Leu Gly His Lys He Ser Ala Arg Gly He 
1125 1130 1135 

Glu Val Asp Arg Ala Lys He Asp Val He Glu Lys Leu Pro Pro Pro 
1140 1145 1150 

Leu Asn Val Lys Gly Val Arg Ser Phe Leu Gly His Ala Gly Phe Tyr 
1155 1160 1165 

Arg Arg Phe lie Lys Asp Phe Ser Lys He Ala Arg Pro Leu Ser Asn 
1170 1175 1180 

Leu Leu Asn Lys Asp Val Ala Phe Val Phe Asp Glu Glu Cys Leu Ala 
1185 1190 1195 1200 

Ala Phe Gin Ser Leu Lys Asn Lys Leu Val Thr Ala Pro Val Met He 
1205 1210 1215 

Ala Pro Asp Trp Asn Lys Asp Phe Glu Leu Met Cys Asp Ala Ser Asp 
1220 1225 1230 
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Tyr Ala Val Gly Ala Val Leu Gly Gin Arg Lys Asp Lys Val Phe His 
1235 1240 1245 



Ala Tie Tyr Tyr Ala Ser Lys Val Leu Asn Glu Ala Gin Leu Asm Tyr 
1250 1255 1260 

Ala Thr Thr Glu Lys Glu Met Leu Ala lie Val Phe Ala Leu Glu Lys 
1265 1270 1275 1280 

Phe Arg Ser Tyx Leu lie Gly Ser Arg Val lie lie Tyr Thr Asp His 
1285 1290 1295 

Ala Ala lie Lys His Leu Leu Ala Lys Thr Asp Ser Lys Pro Arg Leu 
1300 1305 1310 

lie Arg Trp Val Leu Leu Leu Gin Glu Phe Asp lie lie lie Lys Asp 
1315 1320 1325 

Lys Lys Gly Ser Glu Asn Val Val Ala Asn His Leu Ser Arg Leu Lys 
1330 1335 1340 

Asn Glu Glu Val Thr Lys Glu Glu Pro Glu Val Lys Gly Glu Phe Pro 
1345 1350 1355 1360 

Asp Glu Phe Leu Leu Gin Val Thr Glu Arg Pro Trp Phe Ala Asp Met 
1365 1370 1375 

Ala Asn Tyr Lys Ala Thr Gly Val He Pro Glu Glu Phe Asn Trp Ser 
1380 1385 1390 

Gin Arg Lys Lys Phe Leu His Asp Ala Arg Phe Tyr Val Trp Asp Asp 
1395 1400 1405 

Pro His Leu Phe Lys Ala Gly Ala Asp Asn Leu Leu Arg Arg Cys Val 
1410 1415 1420 

Thr Lys Glu Glu Ala Arg Ser He Leu Trp His Cys His Ser Ser Pro 
1425 1430 1435 1440 

Tyr Gly Gly His His Ser Gly Asp Arg Thr Ala Ala Lys Val Leu Gin 
1445 1450 1455 

Ser Gly Phe Phe Trp Pro Ser He Phe Lys Asp Ala His Glu Phe Val 
1460 1465 1470 

Arg Cys Cys Asp Lys Cys Gin Arg Thr Gly Gly He Ser Arg Arg Asn 
1475 1480 1485 

Glu Met Pro Leu Gin Asn He Met Glu Val Glu He Phe Asp Cys Trp 
1490 1495 1500 
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Gly lie Asp Phe Met Gly Pro Phe Pro Ser Ser Tyr Gly Asn Val Tyr 
1505 1510 1515 1520 

lie Leu Val Ala Val Asp Tyr Val Ser Lys Trp Val Glu Ala lie Ala 
1525 1530 1535 

Thr Pro Lys Asp Asp Ala Arg Val Val lie Lys Phe Leu Lys Lys Asn 
1540 1545 1550 



lie Phe Ser Arg Phe Gly Val Pro 
1555 1560 

Thr His Phe Cys Asn Asn Gin Leu 
1570 1575 

Val Arg His Lys Val Ala Thr Pro 
1585 1590 

Ala Glu lie Ser Asn Arg Glu Leu 
1605 



Arg Ala Leu lie Ser Asp Arg Gly 
1565 

Lys Lys Val Leu Glu His Tyr Asn 
1580 

Tyr His Pro Gin Thr Asn Gly Gin 
1595 1600 

Lys Arg lie Leu Glu Lys Thr Val 
1610 1615 



Ala Ser Thr Arg Lys Asp Trp Ser Leu Lys Leu Asp Asp Ala Leu Trp 
1620 1625 1630 

Ala Tyr Arg Thr Ala Phe Lys Thr Pro lie Gly Leu Ser Pro Phe Gin 
1635 1640 1645 

Leu Val Tyr Gly Lys Ala Cys His Leu Pro Val Glu Leu Glu Tyr Lys 
1650 1655 1660 

Ala Tyr Trp Ala Leu Lys Leu Leu Asn Phe Asp Asn Asn Ala Cys Gly 
1665 1670 1675 1680 

Glu Lys Arg Lys Leu Gin Leu Leu Glu Leu Glu Glu Met Arg Leu Asn 
1685 1690 1695 

Ala Tyr Glu Ser Ser Lys lie Tyr Lys Glu Lys Met Lys Ala Tyr His 
1700 1705 1710 

Asp Lys Lys Leu Leu Arg Lys Glu Phe Gin Pro Gly Gin Gin Val Leu 
1715 1720 1725 

Leu Phe Asn Ser Arg Leu Arg Leu Phe Pro Gly Lys Leu Lys Ser Lys 
1730 1735 1740 

Trp Ser Gly Pro Phe lie lie Lys Glu Val Arg Pro Tyr Gly Ala Val 
1745 1750 1755 1760 

Glu Leu Val Asp Pro Arg Glu Glu Asp Phe Glu Lys Lys Trp lie Val 
1765 1770 1775 
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Asn Gly Gin Arg Leu Lys Pro Tyr Asn Gly Gly Gin Leu Glu Arg Leu 
1780 1785 1790 

Thr Thr lie lie Tyr Leu Asn Asp Pro Glx 
1795 1800 



<210> 19 

<211> 9829 

<212> DNA 

<213> Glycine max 

<400> 19 

tgataactgc taaataattg tgaattaata gtagaaaatt agtcaaattt tggcttaaaa 60 
ttaattattt agcagttatt tgtgattaaa agttagaaaa gcaattaagt tgaatttttg 120 
gccatagata tgaaaactga aggtacaaca agcaaaaggc agcagaaagt gaagaaaaag 180 
aataaaatct gaagcagacc cagcccaaca cgcgccctta gcgcgcgtca cgcgctaagc 240 
ttgcaaggca gcacaggcac taagcgaggc gttaagcacg aagatgcagg attcgttacg 300 
tgcgctaagc gcgaggcaca cgctaagcgc gcgatccaac agaagcacac gctaagcctg 360 
cagcatgcgc taagcgcgcc tacgaaggcc caaagcccat ttctacacct ataaatagag 420 
atccaagcca agggagaatg tacaccttgc ctcagagcac ttctctcagc attccaagct 480 
tgagctctcc cttttctctc tatattcttt gcttttatta tccattcttt ctttcacccc 540 
agttgtaaag cccctcaatg gccatgagtg gttaatcccc tagctacggc ctggtaggcc 600 
taaaaagcca atgatgtatg gtgtacttca agagttatca atgcaaagag gattcattcc 660 
aggttttatg ttctaattct ttccttttta tcttgcattt atgtcttaaa tttctgttgg 720 
gttttattcg ctcgggagag ggtatttcct aataagggtt taagaagtaa tgcatgcatc 780 
agttttaggg gttatacgct tggtaaaggg taacacctaa tagaacaaat taagaaaagg 840 
atcgtcgggc tagcattgct aggcatagaa tgatggccca atgcccatgc atttagcaac 900 
atctagaatt taaccttaat gcattttaat tattgaatct tcacaaaggc atttgggaga 960 
taggtagtta aaataggctt gtcatcgtga ggcatcaagg gcaagtaaaa ttaatagatg 1020 
tgggtagaac taattcaact gcattggtaa tgaacatcat aaattcattc atcgtaggcc 1080 
aattaggttt gtccggtctt ggcattttca tcaattgtct tcctaaatta tttgatctaa 1140 
tagcaacaat ttattcttat gcctattcct gtttttacta tttactttta cttacaaatt 1200 
gaagagtatt caataaagtg caataaaatc cctatggaaa cgatactcgg acttccgaga 1260 
attactactt agaacgattt ggtacacttg tcaaacacct caacaagttt ttggcgccgt 1320 
tgtcggggat tttgttctcg cacttaattg ccatactata ttagtttgta agcttaattc 1380 
ttcttttctt ggctcattct tttattattc tttactttac tttttcttct atcctttctt 1440 
tcttctccca taaattgcac gggtagtgcc tttttgtttt tatacgaggt agaactgcat 1500 
ctggagacgt tgttcctatt aacttagaaa ttgaagctac gtgtcggcgt aacaacgctg 1560 
caagaagaag aagggagcaa gacatagaag gaagtagtta cacctcacct cctccttctc 1620 
caaattatgc tcagatggac ggggaaccgg cacaaagagt cacactagag gacttctcta 1680 
ataccaccac tcctcagttc tttacaagta tcacaaggcc ggaagtccaa gcagatctcc 1740 
tactcaaggg aacctcttcc atggtcttcc aaatgaagat ccatatgcgc atctagcctc 1800 
atacatagag atatgcagca ccgttaaaat cgccggagtt ccaaaagatg cgatactcct 1860 
taacctcttt tccttttccc tagcaggaga ggcaaaaaga tggttgcact cctttaaagg 1920 
caatagctta agaacatggg aagaagtagt ggaaaaattc ttaaagaagt atttcccaga 1980 
gtcaaagacc gtcgaacgaa agatggagat ttcttatttc catcaatttc tggatgaatc 2040 
ccttagcgaa gcactagacc atttccacgg attgctaaga aaaacaccaa cacacagata 2100 
cagcgagcca gtacaactaa acatattcat cgatgacttg caaccttaat cgaaacagct 2160 
actagaggga agatcaagct gaagactccc gaagaagcga tggagctcgt cgagaacatg 2220 
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gcggctagcg atcaagcaat 

ttggagctta gcacgcagga 
gaagccctca tcgaaaccct 
cactcttctg ttttgcaggt 
caatgtgcaa gccaacaaga 
tacggatttc agggctacaa 
tttaatcacg agccaccggg 
acgaataaag gaaatcaata 
cagcacccta gccaaggtcc 
ctgcaattca tcaaggagac 
ctagaagttc aaatgggcca 
ggtgctaaca tggagaagaa 
agaagagcgc aggaggaggg 
gagaagacag aagaagaaga 
agggaagcca agaaggaaga 
cccaccaaga agaacaagga 
gaaatcacta tgccattcgg 
aaagacatcc tcaccaagaa 
aattgcagtg cgataataca 
accatcccgt gcaccattgg 
agtatcaatc tgatgccctt 
accaagatga cgcttcaact 
gatgtcctgg tcaaggtacg 
gaagaagaca ctgagattcc 
gtggtggata tggggaatgg 
gaccttatca aggcaatgaa 
attgatgagg aagatgtcag 
gtaaatgctt tagactgtct 
aacttggatc aagaagacag 
gaagttccgt ctgagaagaa 
gtgttcttgg aggaagataa 
aataggttgg tagacgtcct 
ctcaggaatt agccctgcct 
agtccgacaa ccctagaggc 
gctcaagctt ttggaggctg 
agtacaggtg gttcctaaga 
gataccaaca cgaactgcca 
agccacacag aaggaccatt 
agggcaggca tactactgct 
cccagagatc aggagaagac 
atgtcattcg ggttatgtaa 
tcagacatgg tggagaagag 
tcatttgaca actatttgag 
ttggtactaa attgggaaaa 
aagatctcag ccagagggat 
ccgccaccaa tgaatgttaa 
aggtccatca aggacttctc 
gtggctttta agtttgatga 
caccactgca ccagtaatga 
tgccagtgat tatgcagtag 



ccttcatgat cacacttatg 

cgcaactttg gtacaaaaca 
cagcaagctg cctcaacaat 
agaagaatgc cccacatgca 
cccctctcgt gaagtaaatt 
ccagggaaat ccatctggat 
gtttaatcaa ggaagaaact 
taaggagcaa aggaaccaac 
gaatcagcaa gaaaagccca 
aagatcacat caaaagagca 
actggcgcat gacaaagccg 
ccccaaggaa gaatgaaaag 
taaggttgaa ggagaagact 
gaaggtggca tcaccaccta 
accactagcc cttccacagg 
gcgttacttt agacgtttct 
ggaagcctta cagcagatgc 
ggggaagtat attgacaacg 
aaggaagcta cccaagaagt 
gaaggaagcc gtaaacaagg 
gtcaatgtgc aaaagaattg 
ggcagaccgc tcaatcacaa 
ccacttcact tttccggtgg 
ccttatctta ggcagaccct 
gaacttagag ttgactattg 
gtacccacag gagggttgga 
ttttctcgag acaccataga 
aaccagtgaa gaggaagaag 
tattcctgag ggagaagcca 
gaagatagag ttgaagatat 
gcctatagtg atcagtaatg 
aaagaaacac agggaagcaa 
actgcatgca catgataatg 
ggctgaatcc aacaatgaag 
ggttcatata ccccatctct 
aaggcggaat gacagtggta 
ctggttggtg gatgtgtatc 
tccccttacc tttcatggat 
tttggatgga tattcaggat 
ggcctttaca tgccccttcg 
cgcactagcc atatttcaga 
catcgaggta tttatggacg 
gaacctagag atggtactac 
gtgtcatttc atggttcgag 
tgaggttgat cagacaaaga 
aggtgtcaga agtttcttag 
gaagattgcc aggcccttaa 
agaatgttca gcagcatttt 
ttgcaccaga ctggaataaa 
gagcagtttt gggacagagg 



ttcccacaaa aagaagcctc 2280 

agctgttgac gaggcagata 2340 
tacaagcgat aagttcttcc 2400 
gagggacaca tgagcctgga 2460 
atataggcat actaaatcgt 2520 
tcaatcaagg ggcaacaaga 2580 
tcatgcaagg ctcaagttgg 2640 
caccatacca gccaccatac 2700 
ccaaaataga ggaactgctg 2760 
cggatgcagc cattcggaat 2820 
aacggcccac tagaactttc 2880 
cagtactgac ttgagggcag 2940 
ggccagaaga aggaaggaca 3000 
agaccaagag ccagagagca 3060 
atctcccata tcttatggca 3120 
tggaaatatt caaagggtta 3180 
ccctctactc caaatttatg 3240 
agaatattgt ggtaggaggc 3300 
ttaaagaccc cggaagtgtt 3360 
ccctcattga tctaagagca 3420 
ggaatttgaa gatagatccc 3480 
ggccatatgg ggtggtagaa 3540 
acttttttat catggatatc 3600 
tcatgctgac tgccaactgt 3660 
ataatcagaa gatcaccttt 3720 
agtgcttcag aatagaggag 3780 
cttcgctaga aaaagcaatg 3840 
atctgaaggc ttgcttggaa 3900 
atttcgagac gctagagaag 3960 
tgcctaatca tttgaagtat 4020 
cactcacaac agaggaagaa 4080 
ttggatggca catatcggat 4140 
atggaagagg actacaagcc 4200 
gaagaggtaa gaaaggaggt 4260 
gatagcgctt gggtaagtcc 4320 
cgaaatgaga ggaatgactt 4380 
gactatcgca agttgaatga 4440 
tagatgctgg aaaggcttgc 4500 
acaaccagat cgcggtagac 4560 
gcgtctttgc ttacagaagg 4620 
ggtgcatgct agccattttt 4680 
acttctggat ttttggaccc 4740 
agaggtgcgt atagactaac 480 0 
agggcatagt cctgagccac 4860 
tagacgtcat tgagaagttg 4920 
ggcatgcagg tttctacagg 4980 
gcaatctgtt gaataaggat 5040 
tagacactaa agaataagct 5100 
gattttgaac taatgtgtga 5160 
cacgacaagg tatttcacgc 5220 
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catctattat gctagtaagg tccttaataa agcataacta aattatgcga ccacagaaaa 5280 f 
gcagatgcta gccattgtct tttccttgga gaagttcagg tcgtacttga tagggtcgag 5340 
ggtcaccatt ttcacaaatc atgctgccat caagcacttg ctcgccaaaa cagactcaaa 5400 
gctgaggttg attagatggg tcctgctgat acaagaattt gacatcatca tcaaggacaa 5460 
taaaggatcc aagaatgtgg tagccaatca tttatcctga ttaaagaatg aagaagtcac 5520 
caaggaagaa ccagaggtaa aaggagaatt tcctgatgaa tttcttttgt aggttaccac 5580 
cagaccttgg tttgcagaga tggctaacta caaagccaca ggagtcattc cagaggagtt 5640 
taattggagt cagaggaaga aattcttgca tgatgcacgc ttctatgtgt gggataatcc 5700 
tcatttgttt agggcaggag ctgataatct attaaggaga tgcgtcacaa aggaggaagc 5760 
acagagcatt ctttggcact gccacagttc accctatggc ggacaccaca gtggggacag 5820 
aacagcagca aaagtgctac aatcaggttt tttctggcct tctattttta aagatgctta 5880 
cgagtttgtg cgttgttgtg ataaatgcca gagaacaggg gggatatctc gaaggatgga 5940 
gatgcctttg cagaatatca tggaagtaga gatctttgac tgttggggca tagacttcat 6000 
ggggcctctt ccttcttcat acgagaatgt ttacatcctg gtagctgtgg attacgtctc 6060 
caaatgggtg gaggccatag ccattccaaa agacgatgcc agggtagtga taaaatttct 6120 
gaagaagaac atcttttccc attttggagt cccatgagcc ttgattagtg atggggaacg 6180 
cacttctgca ataatcagtt gaagaaagtc ctggagcact ataatgtaag acataaggtg 6240 
gccacacctt atcaccctca gacaaatggc caagtagaaa tttctaacaa agagctcaag 6300 
cgaatcctgg agaagacagt tgcatcatca agaaagaatt gggccttgaa gctcgatgat 6360 
actctttggg cctacagggc agcattcaaa actcccatcg gcttatcacc gtttcagcta 6420 
gtgtatggga aggcatgtca tttaccagtg gagctggagc acaaagcata ttaggctctc 6480 
gagttactca actttgataa caacgcatgc ggagaaaaga ggaagctaca gttgctggaa 6540 
ttagaagaga tgagactgaa tgcctacgag tcatccaaaa tttacaacca aaagatgaag 6600 
gcatatcatg acaagaagct acagaggaaa gaattccaac catggcagca ggtattactc 6660 
tttaaatcaa ggctaaggct attcccaggt aagctgaagt ccaagtggtt agggccgttc 6720 
ataatcaatg aagtcagacc tcacggagca gtagaattgg gggaccctag agaagagaac 6780 
tttgagaaga aatggatcgt caatggacaa cgcttaaagc tttataacga aggacaacta 6840 
gagcgattga cgaccatcat ctacttgaat gacccttgag gaggcctagt gtctagctaa 6900 
agacaataaa ctaagcgctg gttgggaggc aacccaacat attttgtaaa aatgtagtca 6960 
tttttctgta ttccttcaaa aaaaaaggga aaagcccaat aggtgcaaat agaaaacagc 7020 
aggtgcagaa agtaaagacc cagtaggtga agtcagcaat aggaggggtg ccaatagaag 7080 
aagcgaagtg ggctgcacga agccacgcgc atctaggcgc taagcgccta ggtatatttt 7140 
caatttttaa attttaaaaa ttctgaggga aaccaaggga cgcttccctt ggtatgctta 7200 
gcgaccagat gcgcgctaag cgcgcgaacc ataaattgct ggacagtttt caaaactgtc 7260 
ccacccctca gctgcccttt tgtattttaa atttcaacca cctcattttt ttttctcttc 7320 
tgcgcactcc cactccctat accctttttc tctacatttc ctctaaactt actcgcctcc 7380 
ctgtgcctct tcacgtagtt tttacgaaaa taggtgagat tgggaatctg gactgttgct 7440 
gtaatacttt gcaggtacca tcacgctaag ccctacacaa aggcttagcg agaaaaagaa 7500 
acatagaaag gaagaaagaa gcatgcgcta agcctgcgcc agacaggaca agaaaacaca 7560 
gcatgcgttt agccggcacc tcgtgctaag cgcgctcatg agacfccagtg aacgcgctaa 7620 
gcatggggct gggccttagg gcccatcagc cctcgtgcct tactttctgc acccfcctttt 7680 
tcactaacta cactcccttc tgaatttctt tttgcaccct cctctattac taaccacaat 7740 
ctatttttcc gtctttgttt ctttgttttt tcagatggcc tcccgcaaac gccgagctgt 7800 
gcccacacct ggggaagcat caagctggga ctcttcccgc ttcacctcgg agatcatttg 7860 
gcatagatac caggataaca ttcagctccg gaacattctt ctggagagga atgtcgagct 7920 
cacacccagg atgtttgatg agttcctcca ggagctccag aggtgcagat gggaccaggt 7980 
gttaacccga cttccagaga agaggattga tgtcgctctg gtgaaggagt tttactccaa 8040 
cttatatgat ccagaggacc atagtccaaa gttttgtagg gttcaaggac aggtcatgtg 8100 
gtttgatgca gagacgatta acgacttcct tgacacccca gtcatcctgg cagatgtaga 8160 
ggagtaccca gcctactctc agtacctccg cactcctccc gatcatgatg ccatcctctc 8220 
cactttgtgt actccagggg gacggtttgt tctgaatgtt gatggtgccc cctagaagtt 8280 
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gctgcggaag gatctgacga cactcgctca gacatagagt gtcctttctt attttaacct 8340 
tgttcttact tctcacactt ctgatattaa tgttgacagg gcccgtctca tatatggctt 8400 
ggtgatgaag atggacctgg acgtggacag ttttatttcc cagcaaatca gtcagatcgc 8460 
ccaatccaac acatccaggc tcgggttccc agcgttgatc acggcactgt gtgacattca 8520 
gggggttgtt tctaacaccc tgatttttga gttactcaat cctatgatta accttgcgta 8580 
cattacacta ctaaaaaaaa gctattttac gacgcgcgtt ccacatcgtt tctgccaaaa 8640 
atgtcgtaat aggagtagcg gtggcaattc cgtaaataag tgagcatttt atgtgccatg 8700 
tgcatggcgc gtgacacatt caacgacgtt ggccatgggt gcccgtcttt gtaggtggcg 8760 
cgctggtaac ttaagacggt gcactfcaaaa acatcgtcgt tgaaattttg aatttcgaag 8820 
acgttgctct taagccaccg tcgttaaggt tgatgtatat aatgttgtaa tttgcgctat 8880 
ttcgtgaaca ctcgctcgag ctcccgcttc cctgtgtgtc tgaaatttct gtgtactgtg 8940 
acctcgccat gacttgtggc gtttgcccac acccccgtca cctcgtccgg catctcgtct 9000 
tgtggtggca ccgccgaagc cagtgagtac ccctttttgg aggggtcgta acacggctgt 9060 
gttttgaagg taaggttgtg cgaagatttg atgctccata gttgttactt gctctgagtt 9120 
tttcttttag tgatgtatct tttacccctc tttcagtgct tcttccctca gaatttgatt 9180 
gccggtatta gaaccccact attcatcagg tccaaacaag cttaaatcat ggtaaatgta 9240 
cttcttgaca aatccaacat ttgcaaggtg gtttgacata tgagaaatag ctttaaccta 9300 
atgttcttaa atttattatg aagctctcta gcgattacga aaatctctca atatcttctc 9360 
tctctgtctc acatgcatca ctgtaagata ggtgtcaaaa agaaaggatt gaagttaaat 9420 
ttaaacctaa tgttttgaaa tgaaggaaaa aaagaaagag attaatgacg ctagggaact 9480 
tgaatgaaga aagagaaagg aacataatta gtcctttgaa ctgattgggg tggggagtgt 9540 
ggcacgaaac ataatttcta gttctatgga tttattcgtg acactgtggt aggaccaagc 9600 
aaactctgcc cccagagtgc gcagtgtctt gcagtctgag aggttctttt gttgggctag 9660 
tttgaggaat tcttcattgc agggttgagc acggtggcca atggccaagg agagaaaaga 9720 
cagtactgtc aaaatggtta atggtaagat gagtgaagat gacatgtttt tttgttgtct 9780 
ctttgtgtgt ttccttttgg tgggaaaatg tgatgcatag agagatcga 9829 

<210> 20 
<211> 12571 
<212> DNA 
<213> Glycine max 

<400> 20 

gatcttaaat tcttaaactt tgataacagt gcatacggag agaagagaaa gttgcagfcta 60 
ctggaactcg aagaaatgag gttgaacgct tacgaatcat ctaggattta caagcagaag 120 
gtaaaggcgt atcatgataa gaaattacaa aagaaagaat tccagccagg gcagcaagta 180 
ctactcttca actccaggtt gagattattc acaggaaagc tgaagtcaaa gtggtcagga 240 
tcgttcatta ttaaggaaat cagacctcac ggagcggtag aattggtgga ccctcgagaa 300 
gaaaattatg agaagaaatg gatcgtcaac ggacaacgct taaaaattta caatggagga 360 
caactagaga agttgacgac catcatgcat ttaaaagatt cttgaaagaa gccctatgtc 420 
tagctaaaga cattaaacta agcgctggtt gggaggcaac ccaacatact tatgtaaggt 480 
atttataagt atttatattc tgtctttatt atattttgca gttgttattt caggttaaaa 540 
gaaaaaacag gggccctccg gactcgcacc agagtatcaa cgtccatatc tgaggcaccc 600 
cctacttctc agccttccgc tccatcacct actgatcttc atgctcagat gttgcggtct 660 
attcacacag gacaggagac ccttatggag aacatgcaca agctgtcctt tcatctacat 720 
atggatccac cactgatcac tccataggtc tatcgtcagc gggtcgtctg gccatgagac 780 
cagctctcca ctgacagggg ggaagagccc tctggagatg ctgcagttga tgaagacctc 840 
atagcagact tggctagtgo tgattggggt ccatgggcag atttgggagg cggcacagga 900 
cactggtttt atttttcttg atgtttttgt ttatgtttaa tgtttatgtt ttatgtcttt 960 
atgttttatt tggtttctag ttattatggt cttaattgta gttttatgtt caaaatgaaa 1020 
agcagtggta ataatattag atttgagcat atgcgtgaat aaataaattg catgataact 1080 
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tgagaaatga caattttgag tttgttctaa 
ttggaaagca ctggtcttgg aagcaaaagt 
aaaggaaagg ttagcttgat ggaatgaaga 
ccggtgagag tgtgacctta attgtgagag 
catcattctt ggactgttaa aattagttac 
ttttatttta cccactcagc caaaaagcca 
attgagccaa aaagaattat aatgatttat 
attcctaacc ttgtgtagga ttctaagaga 
tattttgggg gattttgaac aaatgggtaa 
acctctaaat aattgtgagc ccattactat 
tattattatt attattatta ttattattat 
agaaagaata agaagagaaa gggcaaagaa 
agtgctgaag gcaaaaaagg ctaagtggga 
gaatgtatgc tctcttataa ccttatattt 
ttagccaggc cccattacaa ggcatgaaag 
ctaaccttaa ccaagatgaa gtacaaaact 
gcaaacactt gaccaggcac ttgtgagtag 
gcaagccgga cctgttggaa ttccatataa 
tttatttcaa gatcatgaca gatgcaaaga 
atggagagtg ttggaatgat tggaacttgc 
attttattct ttttatttgc ttggggacaa 
tgctaaataa tagtgaatta atagtggaaa 
tttagtagtt atttatgctt taatttggaa 
gatgtgaaaa agggaggtac aacaagcaaa 
aaatcagacg aagacccaag cccaaatttt 
aacacacaca ctttcagaga gctcagtttt 
ccttcccttt ttcttatatt cttattacct 
tcttgactat gagtggctaa acccctagct 
gtatggagca tttcaagagt tatcaataaa 
ccgttctttc ttatttatcc tgtatttcgg 
cgggagaggg taaagcctaa ttaggggtaa 
tagtccattc gggagagggt aaagcttaat 
atcattagag ggttttcctt ccaggttctt 
cagtctttat tttctgttag tctttagtcc 
taaggaatga ttgcgtgaat ctgttttaag 
aatagaacaa taaaagaaaa aaatcacagg 
caaacatata gaatttaatc ttaatgcatc 
ggaagatagg taattaaggt aggcttgtca 
agatgtgggg cagaatcagt tcactggtat 
ctaggctgat tagacttttt aggttttagc 
ttattgtttg aagtttctta ttctattgtt 
ttactgttgg gttttcttag aaatagttat 
attccttatt ttactgttgg gttttattag 
tttaggctta ttagatttag taatatttta 
ttcctttaat ttagaagtag ctgcttagat 
tatctttaaa tcttttatct tttccttatc 
caaatttctt atcccttgct agatttaaat 
tttgcaaact aaaaagtact tcacataagt 
cttaccgaga gattattact acgagcgatt 
attgcctgat catctaaagt atgtgttctt 
cgcactcaca acaaaggagg aaaataggtt 



aaggtccaac actggaaagg ctactagtca 1140 
caaatcaagg aatgaaacat gattcacgga 1200 
cacatctggt acgccaatac tgaattaatc 1260 
aaaacgcctg tttttaagct cttagttttg 1320 
ttaaggtgga tatgatcaag gccatgtttg 1380 
acccaacata attttatccc ttgcacccat 1440 
ttgagtaaac ccctgagcca agaaattgat 1500 
gcagtagggt tccaaatgct tataaggcct 1560 
agtagccaag gtaataacac acattagaac 1620 
tattattatt attattatta ttattattat 1680 
tattggttat aaaaaaaaga agaaaaaaag 1740 
aaaaaatgaa aaagagaggt ttcagtggaa 1800 
aataggtctt ggcaagacct taaatttttg 1860 
tgaatttcca agaaaaacca tgattctttg 1920 
tccttagtga cccaccgaag gtaattaagg 1980 
cttgagtttt atttacaggt tgttaaaatt 2040 
agagaaacac cagttttgta aggaagtaag 2100 
ttgacttgtt tctgctcttg tgtttatgct 2160 
gaccagccaa aggatcaagg aattgaagtc 2220 
ttgagaaaat ttttgcttaa gaatggaata 2280 
gcaaagttta atttggggga ttttgataac 2340 
attggtctga aattaactta gaattaatta 2400 
agatttaatt aattttgaat tctgattgca 2460 
aaggagcaaa aataaagaaa aagaagaaga 2520 
cacctataaa taagaaggtc agcctagcaa 2580 
cagacttctg gcactcagtt ctctccttct 2640 
ttctttcacc cccttctcat tgtaaagccc 2700 
agggcctggc aggcctaaaa agccaatgat 2760 
gagaggattt ccttccaggt tctttattta 2820 
accttatttt ctgttagggt ttagtccact 2880 
ggaatgaata cttgaatcta ttttaagggt 2940 
agaacaataa aaggaagaaa ttatcgggtt 3000 
ttatctgctt ttctttctta ttctgcatct 3060 
actcgggaga gggtaaagcc taattaaggg 3120 
ggttagttca ctcaggagag ggtaacgctt 3180 
gttagcattg acccgatgcc catactttag 3240 
ttagttattg agtctttgca aagggcattt 3300 
tcatgaggca tcaggggcaa gtagatggat 3360 
tgataacaga caaatcttga atccatatat 3420 
aattttatta tatagatttt attccctatt 3480 
gggttttctt agaagtagct attccttatt 3540 
tccttattgt tgggtttctt agaagtagtt 3600 
gagtacttat cccctgttta ggagtaggta 3660 
tagactttat tctttattta ttgcttgagt 3720 
ttaaattact ttatctttat cctttaatct 3780 
ttatctttta tctttcttta tcttttattt 3840 
tgcatttaat tttatacact aaatttacaa 3900 
gcaacaaaat ccctatggta cgatactcga 3960 
tggtacactt gccaaagagc taacaaagat 4020 
ggaggaagat aaacctatag taatcagtaa 4080 
ggttgatgtc ctcaagaaat acagggaagc 4140 
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aattggatgg catatatcgg atctcaagga aattagccct gcttactaca tgcacagaat 4200 

aatgatggaa gagaactaca agccagtccg acaaccccag aggcggctga atccaacaat 4260 
gaaggaagag gtaagaaagg aggtactcaa gctcttggag gctgggctca tatacccctt 4320 
ctctaacagt gcttgggtaa gcccagtaca ggtggttccc aagaaaggtg aaatgacagt 4380 
ggtacgaaat gagaagaatg acttgatacc cagacgaact atcactggtt ggcgaatgtg 4440 
tatcaactat cgcaagctga atgaagccac acgaaaggac catttcccct tacttttcat 4500 
ggatcagatg ctagagagac ttgtagggca ggcatactac tatttcttgg atggatactc 4560 
gggatataat cagatcgcgg tggaccccag agatcaagag aaggcggcct ttacatgccc 4620 
ttttggcgtt tttgcttata gaaggatgcc attcgggtta tgtaatgcac cagccacatt 4680 
tcagaggttc atgctggcca ttttttcaga catggtgtag aaaagcattg aggtatttat 4740 
ggacgacttc tgggtttttg gaccctcatt taacagtttg aggaacctag agatggtact 4800 
ttagagttga gtagagacta acttggtact gaactgggag aagtgtcact tcatggttca 4860 
agagggcatc gtcctaggcc acaagatctc agcaagaggg attgaggtcg atcgggcaaa 4920 
gatagacgtc atcgagaagc tgccaccacc actgaatgtt aaaggggtta gaagtttctt 4980 
agggcatgca ggtttctaca agaggtttat caaggacttc tcaaagattg ccaggcccct 5040 
aagtaacctg ttgaataaag acatggtttt caagtttgat gaagaatgtt caacagcatt 5100 
ccaatcattg aagaataagc ttaccactgc acctgtaatg attgcacccg actggaataa 5160 
agattttgaa ctaatgtgtg atgccaatga ttatgcagta ggagcagttc tgggatagag 5220 
gcacgacaag gtatttcacg ccatctatta tgctagcaag gtcctgaatg aagcatagtt 5280 
gaattatgca accatagaaa aggagatgct agccattgtc tttgccttgg agaaattcaa 5340 
gtcatacttg atagggttga gggtcaccat tttcacagat catgctgcca tcaagcacct 5400 
gcttgccata acagactcaa aaccgaggtt gattagatgg gtcctactgt tacaagaatt 5460 
tgacatcatc atcaaggaca agaaaggatc cgagaatgtg gtagccaatc atctatctcg 5520 
attgaagaat gaagaagtca ccaaggaaga accagaggta aaaggtgaat ttcctgatga 5580 
gtttcttttg caggttaccg ctagatcttg gtttgcagac atggccaatt acaaagccac 5640 
gggagtcatt ccagaggagc fctaattggag tcaaaggaag aaattcttgc acaatgcacg 5700 
cttctatgtg tgggatgatc ctcatctgtt caaggcagga gcagataatt tactaaggag 5760 
atgcgtcaca aaggaggaag cacggagcat tctttggcac tgccacagtt caccctatgg 5820 
cggtcaccac agtggggaca gaacagcagc aaaagtgcta caatcaggtt ttttctggcc 5880 
ctctattttt aaagatgctc acgagtttgt gcgttgttgt gataaatgcc aaagaacagg 5940 
ggggatatct cgaagaaatg agatgccttt gcaaaatatc atggaagtag agatcfcttga 6000 
ctgttggggc atagacttca tcgggcccct gccttcgtta tatggaaatg tctacatctt 6060 
ggtagttgtg gattacgtct ccaaatgggt ggaagtcata gctacgccaa aggatgatgc 6120 
caaggtagta atcaaatttc tgaagaagaa cattttttcc cgttttggag tcccacgagc 6180 
cttgattagt gataggggaa cgcacttctg caacaatcag ttgaagaaag tcttggagca 6240 
ctataatgtc cgacataagg tggccacacc ttatcatcct cagacaaatg gccaagcaga 6300 
aatctctaac agggagctca aggcgaatct tggaaaagac aattgcatca tcaagaaagg 6360 
attgggcctt gaagctcgat gatactctct tggcctatag ggcagcgttc aagactctca 6420 
tcggcttatc gccatttcag ctagtgtatg ggaaggcatg ccatttacca gtggagctag 6480 
agcacaaagc atattgggct ctcaagttgc tcaacttcga caacaacgca tgcggggaaa 6540 
agaggaagct acagatgttg gaattagaag agatgagact gaatgcctac gagtcatcca 6600 
gaatttacaa gcaaaagatg aaggcatatc atgataaaaa gctacagagg aaagaattcc 6660 
atccagggaa gcaggtatta ctctttaact cgaggctaag gctattccca ggtaagctga 6720 
agtccaagtg gtcaaggcca tttatcataa aagaagtcag acctcatgga gcagtagaat 6780 
tggtggaccc ttgagaagag aactttaaga agaaatggat cgtcaatcga cagcgcttga 6840 
agccctacaa cggaggacaa ctcgagcgat tgacgaccat catctactta aatgatcctt 6900 
gagaaggcct actgtctagc taaagacaat aaactaagca ctggttggga ggcaacccaa 6960 
catatttttg taaaaatgta gttattttta ttttatgtaa aaaaaaacaa gagggcccaa 7020 
taggtgcaaa tagcaaacag gaggtgcaaa aagcaaaggc ccaacaggtg aagacaacaa 7080 
taggaagggt gccaatagca aaactgaagt gggctgcatg aagccgcgcg ctaagcgccc 7140 
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aggtatgttt ttaaaatctg atgggcaacc aagggacgct ttccttggtg cgcttagcgg 7200 
ccacatgcgc gctaagcgcg taagtcataa attactggac agttttcgaa actgcccaac 7260 
ccctcagctg cctcctccgc gttattaaat tacaaccatt tcatttcatt atccttcttt 7320 
tctttcgcaa atctaccctt ctttgcacct ctgctactgt aacccctgaa ttcttggtct 7380 
tttcacacaa aacaatcact aacgaaggta aagaattgct ttgtatggat gttgttatga 7440 
atgcacaggt aacagcacgc taagccctgc tcgacgctta gccaatgaag acggattgaa 7500 
ggccataacg acgagctcgt taagcgtgac gaagcacgct aagcaggcgc ctgacaggac 7560 
gagaaagcaa agcgcgcgct tagccggcac ttccgcgcta agcgcgctca tgaacatcac 7620 
tgaacgcgct aaacgtgtgc cagaggcgct aaacgcgtgc cagaggcgct aaacgcgtgc 7680 
attagtcaca gcaggatggt gctaagcgcg gggttgggcc tcagggccca tcaaccctcg 7740 
caccttactt gttgcacccc tatttctact attcccactc ccttctaatt tctttttgca 7800 
ccccccttct ttactgactg cacctctatt ttgattactt tttgcacccc ccctgattgc 7860 
taacttcaga ctatctttct tgttttttgt ttttttggtt ttttggtcag atggcctcct 7920 
gtaaacaccg agctgtgccc acacccgggg aagcgtccaa ctgggactct tcacgtttca 7980 
ctttcgagat tgcttggcac agataccagg atagcattca gctccggaac atccttccag 8040 
agaggaatgt agagcttgga ccagggatgt ttgatgagtt cctgcaggaa ctccagaggc 8100 
tcagatggga ccaggttctg acccgacttc cagagaagtg gattgatgtt gctctggtga 8160 
aggagtttta ctccaaccta tatgatccag aggaccacag tccgaagttt tggagtgttc 8220 
gaggacaggt tgtgagattt gatgctgaga cgatfcaatga tttcctcgac accccggtca 8280 
tcttggcaga gggagaggat tatccagcct actctcagta cctcagcact cctccagacc 8340 
atgatgccat cctttccgct ctgtgtactc cagggggacg atttgttctg aatgttgata 8400 
gtgccccctg gaagctgcfcg cggaaggatc tgatgacgct cgcgcagaca tggagtgtgc 8460 
tctcttattt taaccttgca ctgacttttc acacttctga tattaatgtt gacagggccc 8520 
gactcaatta tggcttggtg atgaagafcgg acctggacgt gggcagcctc atttctcttt 8580 
agatcagtca gatcgcccag tccatcactt ccaggcttgg gttcccagcg ttgatcacaa 8640 
cactgtgtga gattcagggg gttgtctctg ataccctgat ttttgagtca ctcagtcctg 8700 
tgatcaacct tgcctacatt aagaagaact gctggaaccc tgccgatcca tctatcacat 8760 
ttcaggggac ccgccgcacg cgcaccagag cttcggcgtc ggcatctgag gctcctcttc 8820 
catcccagca tccttctcag cctttttccc agtgaccacg gcctccactt ctatccacct 8880 
cagcacctcc atacatgcat ggacagatgc tcaggtcctt gtaccagggt cagcagatca 8940 
tcattcagaa cctgtatcga ttgtccctac atttgcagat ggatctgcca ctcatgactc 9000 
cggaggccta tcgtcagcag gtcgcctagc taggagacca gccctccact gacagggggg 9060 
aagagccttc tggagccgct gctactgagg atcctgccgt tgatgaagac ctcatagctg 9120 
acttggctgg cgctgattgg agcccatggg cagacttggg cagaggcagc tgatcttatg 9180 
ctttaatgtt ttcttttata ttatgtttgt gttctctttt atgttttatg ttatgttttt 9240 
atgtagtctg tttggtaatt aaaaagaggt agtagtaaaa atattagtat ttcagtatgt 9300 
gttttctgag taataagtgc atgataactc aagcaatcat aattctttag cttgttcaga 9360 
aaggttcaac acttgagatg ccactgatcc ttggagaaac actggttctg gaagcaaaag 9420 
tcaggtcaag aaatggaaca tgaatagcac agagtggaaa ggttagcttg atggaacaag 9480 
gtcataactg gtacgccgaa tacttgttta agtccctgtg agcatggttg tcaaactcta 9540 
gagtcaactc atagactctc atgagtttaa gagtttactt cagtcccgcg agttgactcg 9600 
gaagcaaact cgcttttgag caaactcgtg gactcggagt gaactcatgt aaactcgtaa 9660 
gagtctacga gttgactcta gagtttgaca accatgcata agtgttcaaa attaaagcat 9720 
ttaaataatt aaaaaaagca caaatgtctt caaagaagca tgttcaatcc tctaatagga 9780 
tcatcttcat gaatatcatc actttcatca tcatctccat ctccatcatc atcatcaagg 9840 
tcttcctcag attgtgcatc atcattaggt tccacaaaga ttaaattatc tagatcaaaa 9900 
gcttaaaata gatatcaaat atgctatatt agaaatagtt aaaacttaaa ataatacaca 9960 
agcaaatttt aaatatgaga aagttcagaa attatacctt ttcttggtgt tattaaagtt 10020 
tcattttatc ttctcttttg cattttccat ctcctcacat atgaaaagca taattctatt 10080 
gaatttcagt aacaagtttg atccaactcc aacattgtaa ggtcagttgt tgtgttttgt 10140 
aatagactaa tatgaagtat gaagtatgaa ctatgaactt attgtcatct gtttgcaaat 10200 
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tggtgcattt tgaatatatt tacttattat ccattttttt ttttttacga agtagactct 10260 
cacgagtctg cgtagactct cgatatcgat aaccttgccg atgagagtgt gaacttaatt 10320 
gtgagagaaa atgcctattt ttaagttcct ggttttgcat cattcttaga cggttagaat 10380 
agttacttaa ggtggatatg atcaaggcca tgtttgtttg tttacctact tagccaaaaa 10440 
gccaacctaa catagtttta ccccttgcac ccatgattga gccaactgat tattttgaat 10500 
taaccttgag ccaattaaac aaaatcctga ccttttagga ttttaagaga gtaaaaatgg 10560 
gttataaagg tcttaatttg ggggattttg ggaaataggt agccaagaca ataagtacag 10620 
cacacaaagt aggacacctt ttacaaacag taggcccaat ttcgaaaaaa aaatgaaaag 10680 
aatttaataa agggcagaaa caaaagagca agagaggtgt caaaagaaaa gtgttgtggg 10740 
gaaataaaag ggctaagtaa aaaggcctag gcagaattgg aaatttttgt tctcttttaa 10800 
tcctaacttt gaatttccaa gaaaaaccat gattttttgt aagccaggcc ccgatacaag 10860 
ccaataaagt ccttagtgat ccaccaaagg taactagaga taactgtaac tgagatgaaa 10920 
tgcaaaattt tgaagtgtta cttgcaggtt gttatcaaat tgcaaacact aaactaggca 10980 
cttgtgagca gagggaaaca ccagccttgt gaggaaagta aggcaagcca aatttgattg 11040 
agttccagat gactaactga ttcaattctt ctgttgtaat gctttcattt taagatgttg 11100 
acagatgcag aaaggaccag tgaaagaagg aggaactgag ccattgatag tgttggaata 11160 
tttaagaact tgcttgagaa tttacttgtt tttggttttc ttggggacaa gcaaagtttc 11220 
atttggggaa ttttgataac tgctaaataa ttgtgaatta atagtaaaga attattcaaa 11280 
ttttggcctg aaattaatta tttagcagtt atttgtgatt aaaagttaga aaattaatta 11340 
aattgaattt ttggttgcag ataagaaaat tggagttaca ttaagcaaaa aaggcaacaa 11400 
aaaatgaagg aaaagaagaa gtctgaagca ggcccagccc aacacgcacg ctaagcgcgt 11460 
gtcacgcgct aagcgtgcaa ggcagtacag gcgctaagcg aggcgttaag ctcgaagatg 11520 
cagaatccgt tacgcgcgct aagcaagggc cacgcgctaa gcgtgcgatc caacagaaac 11580 
acacgctaag cctgcatctc gcgctaagcg cgcgatctga acgcgctaag cgcgaggtgt 11640 
cgcgctaagc gcgcttacga aggcccaaaa cccactttag cagctataaa tagagagtca 11700 
gtccaaggga aacaacacat ctcgcctcag agcacttccc tcagcattct aagcctaagc 11760 
tctccctttt ctctttgttt ttattatcct cattctttct ttcaccccca gttgtaaagc 11820 
cctcaatggc catgagtggc taatctagta gctagggcct ggcaggccta aaaagccaac 11880 
gatatatggt gtacttcaag agttatcaat gcaaagaaga ttcattccag gtttttttgt 11940 
tctaattatt ttctttttat cttgcattca tttcttgaat ttcttttggg ttttatttgc 12000 
tcgggagagg gtatttccta ataagggttt aaggattaat gcatgcatca gttttagggg 12060 
ttatacgctt gggaaagggt aacacctaat agaacatctt aagaaaagaa tcatcgggtt 12120 
agcattgcta ggcatagaat gataactcaa tgcccacgca tttagcaaca tctagaattt 12180 
taccttaatg cattttaatt attgagtctt cgcaaaggca tttgggagat aggtagttaa 12240 
aataggcttg tcatcgtgag gcatcagggg caagtaaaat taatagatgt gggtagaact 12300 
gttacaaatg cattggtaat gaatatcata tttacatgca tcgtaggcca attgggtttg 12360 
tccggtcttg gcatttatat taattgtctt tctaaaacta tttgatctag taatagcaat 12420 
ctattcttgc acttactcct gtttttacta ttttactctt acaaattgaa aagtattcga 12480 
taaagtgcaa taaaatccct gtggaaacga tactcggact tccgaggttt actacttaga 12540 
gcgatttggt acacttgcca aagtctcaac a 12571 

<210> 21 
<211> 4609 
<212> DMA 
<213> Glycine max 

<400> 21 

gatctcccat atcctatggt 
ttggaaatat tcaaaggact 
cccctctact ccaaatttat 
gagaatattg tggtaggggg 



acccaccaag aagaacaagg aacattactt ctgacgtttc 60 
ggaaatcacc atgccattcg gggaagcctt acagcagatg 120 
gaaggacatc ctcaccaaga aggggaagta tattgacaat 180 
caactgtagt gcaataatac agaggaagct acccaagaag 240 
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tttaaggacc ccggaagtgt taccatcccg tgcaccatag gaaaggaaga ggtaaacaag 300 
gccctcattg atctaggagc aagtatcaat ctaatgccct tgtcaatgtg cagaagaatc 360 
aggaatttga agatagatcc caccaagatg acacttcaac tggcagaccg ctcgatcaca 420 
agaccataca gggtggtaga agatgtcctg gtcaaggtac accacttcac ttttccggtg 480 
gactttgtta tcatggatat cgaagaagac acagagattc cccttatctt aggcagaccc 540 
ttcatgctga ttgccaactg tgtggtggat atggggaatg ggaacttgga ggtgagtatt 600 
gacaatcaga agatcacctt tgaccttttc aaggcaataa agtacccata ggagggttgg 660 
aagtgcttta gaatggagga gattgataag gaagatgtca gtattctcga gacaccacag 720 
tcttcgctgg ggaaagcaat ggtaaatgct ttagactgtc taaccagtga agaggaagaa 780 
gatctaaagg cttgcttgga agacttggat tgacaagaca gtattcctaa gggagaagcc 840 
agatttgaga ctctagaaaa ggaagttccg tccgagaaga agaagataga gttgaagata 90O 
ttgcccgatc atctgaagta tgtgttcttg gaggaagata aacctgtagt gatcagtaac 960 
gfcactcacaa cagaggagga aaacaggtta gtagatgtcc tcaagaaaca cagggaatca 1020 
attggatggc acacatcgga tctcaaggga attagccctg cttactgcat gcacaggata 1080 
atgatggaag aggactacaa gccagtctga caaccccaga ggcggctgaa tccaacaatg 1140 
aaggaagagg taagaaaaga ggtactcaag ctcttggagg ttgggctcat ataccccatc 1200 
tctgacaacg cttgggtaag cccagtacag gtggttccca agaaaggtgg aatgacagtg 1260 
gtacaaaatg agaggaatga cttgatacca acacgaacag tcactggctg gcgaatgtgt 1320 
attgactatc acaagctgaa tgaagctaca cggaaggacc atttcccctt acctttcatg 1380 
gatcagatgc tggagagact tgcagggcag gcatactact gtttcttgga tggatactcg 1440 
ggatacaacc agatcgcggt agaccccata gatcaggaga agacggtctt tacatgcccc 1500 
tttggcgtct ttgcttacag aaggatgtca ttcgggttat gtaatgtacc agccacattt 1560 
cagaggtgca tgctgaccat tttttcagac atggtggaga aaagcatcga ggtatttatg 1620 
gacgacttct cggtttttgg accctcattt gacagctgtt tgaggaacct agaaatggta 1680 
cttcagaggt gcgtagagac taacttggta ctgaattggg aaaagtgtca ttttatggtt 1740 
cgagagggca tagtcctagg ccacaagatc tcagctagag ggattgaggt tgatcgggcg 1800 
aagatagacg tcatcgagaa gctgccacca ccactgaatg ttaaaggggt tagaagtttc 1860 
ttagggcatg caggtttcta taggaggttt atcaaggatt tctcgaagat tgccaggccc 1920 
ttaagcaatc tgctgaataa agacatgatt tttaagtttg atgaagaatg ttcagcagca 1980 
tttcagacac tgaaaaataa gctcaccact gcaccggtaa tgattgcacc cgactggaat 2040 
aaagattttg aactaatgtg tgatgctagt gattatgcag taggagcagt tttgggacag 2100 
aggcacgaca aggtatttca caccatctat tatgctagca aggtcctgaa tgaagcacag 2160 
ttgaattatg caaccacaga aaaggagatg ctagccattg tctttgcctt ggagaagttt 2220 
aggtcatact agatagggtc gagggtcacc attttcacag atcatgctgc catcaagcac 2280 
ctgctcgcca aaacagactc aaagctgagg ttgattagat gggtcatgct attacaagag 2340 
tttgacatca ttattaagga caagaaagga tccgagaatg tggtagctga tcatctatct 2400 
cgattaaaga atgaagaagt caccaaggaa gaaccagagg taaaaggtga atttcctgat 2460 
gagtttcttt tgcaggttac cgctagacct tggtttgcag acatggctaa ctacaaagcc 2520 
atgggaatca tcccagagga gtttaattgg agtcagagga agaaattttt gcacgatgca 2580 
cgcttatatg tgtgggatga tcctcatttg ttcaaggcgg gagcaaataa tttattaagg 2640 
agatgcgtca caaaggagga agcacgaagc attctttggc actgccacag ttcaccctat 2700 
ggcatacatc acagcgagga tagaacaaca gcaaaagtgc tacaatcaag ttttttctag 2760 
ccctttattt ttaaagatgc tcacgagttt gtgcattgtt gtgataaatg tcagagaaca 2820 
agggggatat ctcgaagaaa tgagatgcct ttgcagaata tcatggaggt agagatcttt 2880 
gatagttggg gcatagactt catggggcct cttccttcat catacaggaa tgtctacatc 2940 
ttggtagctg tggattacgt ctccaaatgg gtggaagcca tagccacgct gaaggacgat 3000 
gccagggtag tgatcaaatt tctgaagaag aacatttttt cccatttcgg agtcccacga 3060 
gccttgatta gtgatggggg aacgcacttc tgcaacaatc agttgaagaa agtcctggag 3120 
cactataatg tccgacacaa ggtggccaca ccttatcaca ctcagacgaa tggccaagca 3180 
gaaatttcta acagggagct caagcgaatc ctggaaaaga cagttgcatc atcaagaaag 3240 
gattgggcct tgaagctcga tgatactctc tgggcctata ggacagcgtt caagactccc 33 00 



32 



atcggcttat caccatttca gctagtatat gggaaggcat gtcatttacc agtagagctg 3360 

gagcacaagg catattgggc tctcaagttg ctcaactttg acaacaacgc atgcggggaa 3420 
aagaggaagc tacaactgct ggaattagaa gagatgagac tgaatgccta cgagtcatcc 3480 
aaaatttaca agcaaaagac aaaggcatat catgacaaga agctacaaag gaaagaattc 3540 
cagccagggc agcaggtatt actcgttaac tcaaggctaa ggctattccc aagtaagctg 3600 
aagtccaatt ggtcagggcc attcataatc aaagaagtca gacctcacag agcagtagaa 3660 
ttggtggacc ctagagaaga gaactttgat aagaaatgga tcatcaatgg acagcgcttg 3720 
aagccttata acggaggaca actagagcga ttgacgacca tcatctactt aaatgaccct 3780 
tgagaaggcc tactgtcgag ctaaagacaa taaactaagc gctggttggg aggcaaccca 3840 
acatattttg taaaaatgta gttatcttca ttctatgtaa aaaaaaagcc caacaggtgc 3900 
aaataggaaa cacgaggtgc aaaaagcaaa ggcccaacat gtgaagacaa caataggagg 3960 
ggtgccaata gcaaaactga agtgggctac acgaagctac gtgcttagct cgcgtccgcg 4020 
cgctaagcgc ccagattgca caaaaatagg tgagacttgg aatctggact attgctgtaa 4080 
tatcttgcag gtaccattac gctaagccct acacagaggc ttagcgagaa caggcagcat 4140 
ggaaaaaggg aaggaggagc gcgctaagcc acaacaagta atagaagaaa acgaagcacg 4200 
cgcttagcgg gcactgccgc gctaagcgca ctcttcaaca tcagtgaacg cgctaagcgc 4260 
gtgccagaag cgctaagcgc gtgtcaccgt caccagcagg aaggcgctaa gcgcgaggtt 4320 
gggccttagg gcccatcagc cttcgcgcct tactttttgc acaccccttc tttactaact 4380 
gcacccctat tttgatttct ttttgcaccc cctctgttta ctaactgcag tttgtttctg 4440 
ctgtttcttg tttttgtttc agatggcctc ctgcaaacgc cgagccgtgc ccacacccag 4500 
ggaagcgtct aattgggact cttcccgttt cacttcagag attgcatggc acagatatca 4560 
ggacaacatt cagctctgga acatcctttc ggagaggaat gtcgagctc 4609 

<210> 22 
<211> 9139 
<212> DNA 
<213> Glycine max 

<400> 22 

acctggttgt ttgtatgctt gtcttaatgc ggataggttg tcaagtagct ttagtgctaa 60 
cactgagaag aatccgaagg aagaatgtaa agttttaatg acaaagagca gaatggaaat 120 
tcaagttgat gaagttagag ctgaagagaa ggtggaggga tataaacaac agtcgatagc 180 
tgagcctgca ctggaactag tttccgatct tattgaactt gaggaagttt tggaagagga 240 
agatgaccaa caggagagag agacaccaat aaaagatagt caagaaggaa taaagatgaa 300 
ggaagagcat gaaaaagaaa aacaaaaaga aaaagaagaa atagaaaaag aaaataataa 360 
aaaaaatgaa aaataaaaaa agatggttga tgaggagaaa aaaaagagca agagtgaggt 420 
ttcaagagaa aaaaagagag agattacttc agctgaaggc aaggaagtac catatctatt 480 
ggtaccttcc aagaaggata aagagcaaca cttagccaga tttcttgaca tcttcaagaa 540 
actggaaatt actttgcctt ttggagaagc tctccaacag atgccactct atgccaaatt 600 
tttaaaagac atgctgacaa agaagaacta gtatatccac agtgacacaa tagttgtgga 660 
aggaaattgt agtgctgtca ttcaacacat ccttccccca aatcataagg atcccggaag 720 
tgtcactata ttatgttcca ttagcgaggt tgttgtgggt aaagctctca tagacttggg 780 
agctagtatc aatttaatgc ctctctcaat gtgtcgacga cttggagaga tagagataat 840 
gcccacacgc atgacccttc agttggttga tcactccatc acaagaccat atggagtgat 900 
tgaggatatg ttgattcagg tcaagcaact tgtattccct gtagatttcg tggttatgga 960 
tatagaggag gatcctgaca ttcccataat cttgggacgt cctttcatgt ccgcgaccaa 1020 
ctatatagta gatataggga aaggcaagtt agaattgggt gtggaggatc agaaagtctc 1080 
attcgactta tttgaagcaa ataagcatcc aaatgataag aaagcttgct ttgatctaga 1140 
caaggtagaa caataaatag aattagctac tatagccatg gtactgaact ctcctttgga 1200 
aaaagcattg attaatcatg tagaatgtct tactaaagag gaggaacatg aagtgcaaac 1260 
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ttgtattaaa gagttggatg gtgcaggaga aaattctgag ggacaggatg catttcaaga 1320 
attgaagaat ggtgggcaaa tagaaaaacc aaaagtagaa ttgaagacct tgcctgcaca 1380 
tttgaagtat gtatttctcg aagacaatga ctccaaacca gtgattatta gcagctcgtt 1440 
gaagaaaata gaagatcaac tggtgaagat tttgaagaga cacaaagctg caattggatg 1500 
gcacatatct gacttgcaag gaattagtcc atcttattgc atgcacaaaa tcaatatgga 1560 
agctgattac aaaccagtga gagagcctca aagaagactg aacccaatca tgaaagaaga 1620 
gatgcataag gaggtgctta aattgtagga agcaggcctt atttacccct cctcggatag 1680 
tgcatgggtt agccttgtgc aggttgtccc caagaaagga ggtatgacag tcattaaaaa 1740 
tgataaagat gagttaatat ccataaggac tgtcaccggg tggagaatgt gcattgacta 1800 
tcggaagctg aatgatgcca ctcggaagga ccattatcca cttcctttca tggaccaaat 1860 
gcttgaaaga cttgtagggt aatcctatta ttgttttctc gatgagtact ctggctataa 1920 
ttagattgtt gttgatccta aagatcaaga gaagactgct ttcacctacc cttttggtgt 1980 
attcgcatat cggcacatgc cttttggtct gtgcaatgcc ccagctacat ttcagaggtg 2040 
tattatggca attttttctg atatggtgga aaaatgcatc gaagttttca tggatgattt 2100 
ctctattttt gggccatcct ttaaggggtg cctattaaat cttgaaagag tattacagag 2160 
atgtgaagag tccaatctag ttctcaattg ggagaaattc catttcatgg ttcaagaagg 2220 
aatagtgctg gggcataaaa tttcagtaag gggaatagag gtggacaagg caaagattga 2280 
tgtaattgag aaacttcctc ctccaatgaa tgccaaagaa gtgagaagtt tcttatgaca 2340 
tgcaggattc tacagatgat tcataaaaga tttctcaaaa gtcgcccagc cacttagcaa 2400 
tctgttgaat aaagatgttg cttttgtgtt caatcaagag tgcatggaag catttaatga 2460 
tctgaaaacc agattagtgt ctgctccagt aagtatagca ccagattggg gacaagaatt 2520 
tgagttgatg tgtgatgcaa gtgactatgt cgtaggtgta gtgcttcgac aacggaaggg 2580 
aaaacttttt catgctatat actacgccaa caaggttcta aatgatgcac aggtgaacta 2640 
tgctaccata gaaaaagaaa tgctggcaat tgtctatgca cttgaaaagt ttagatctta 2700 
tttggtaggt tcaagagtta tcatctacat cgatcacgca gctattaaat atttgctcaa 2760 
caaggctgat tccaaaccta gattgataag atggatcttg ttgttgcaag aatttgattt 2820 
ggtgattcgg gataaaaagg gatcggaaaa tgttgtagct gaccatttgt ctagattggt 2880 
gaatgaggaa gtcacattga aagaagcaga agtgagagat gaattccctg atgaatcatt 2940 
attcttagtg agtgagagac cttggtttgc cgatatggcc aacttcaaag ctacaagaat 3000 
catcccaaag gacttaactt ggtagcagag gaagaaattc ctacatgatg ctcgattcta 3060 
tatctgggtt gatcctcatt tgttcaagat aggagctgac aatctcctat gaagatgtgt 3120 
gacacaagaa gaggccaaga acatattatg aaattgccac aattctccat gtggcagcca 3180 
ttatggtgga gataagacga tgaccaaggt tttgcaatct ggattctttt ggcccatgct 3240 
tttcaaagat gctcatcagc atgtgcaaca ctgtgatcaa tgtaagagga tgaggggtat 33 00 
atcaagaaga aatgaaatgc ctctacagaa tattatggag gttgaggtat tcaattgcta 33 60 
ggggattgat tttgtaggtc ccttcccttc gtcttttggc aatgaatata tactagtggc 3420 
gattgactat gtctctaaat tggttgaagc agtggctacc ccgcataatg atgctaagac 3480 
tgtggtaaag tttctaaaga aaaacatttt ctcaagattt ggggtgccta gaattctgat 3540 
taacgatgga ggcacacact tctgcaataa tcatctatag aaggtgttga agcaatataa 3 600 
tgtgacacaa agtagcatca ccttatcacc cccagaccaa tgggcaagca gaagtatcaa 3660 
acagggaatt gaaaaagatt ttggagaaga ctatagcttc tactagaaaa gactagtcta 3720 
tcaaattaga tgatgcttta tgggcataca gaacaacatt caagactccg ataggattat 3780 
ctccatttca gatggtgtac ggcaaggctt gtcacttacc agtggagatg gaatataaag 3840 
catactaggc cttgaagttt ttgaactttg atgaagccgc atccagagaa caaaggaggc 3900 
tgcaactttt ggagttggga gatatgagat taactactta tgaatcttca aggctataca 3960 
aagaaagggt caaaaagtat catgacaaga agctgctcaa gaaggacttt cagccaggac 4020 
gacaagagtt gcttttcaac tcaagactta aattgttccc tggaaagctt acatcgaaat 4080 
ggtctggacc atttaccatc aagaaagtcc gcccatatag agcagtggag ctttgtgatc 4140 
ctcaatctaa agatcctgac aggacatggg tagtgaacgg acaaaggttg aatcaatatc 4200 
atggttcatg caatcctacc cctcaagggt attggataga agactccaag aggattgggc 4260 
tagagctgct aaagaaggcc ttggggttct catgaacccc agggtaaatt tctgagccca 4320 
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tggaccaagg ttgggtcctc tcttctttgt aaatattaga ataggttttt ccttcttctc 4380 

aggctaagca ccaatatgct tctgtttttc agtcctttga ataaggctaa gcgcagctgc 4440 
tgcactaagc ccttgttgtg tgtcaaggag gttgagctaa gcgtgcccta ctgcgctaag 4500 
ctcaactatc tcactatttt tgtgttttta tggtcaggct aagcgcgccc tatgtgctaa 4560 
gcctaagggt cattctggtg agcgtgagct aagcgcgcca tgctgcacta agcttagacc 4620 
cttttttgtt ttgaaaattt tagacttagg ctaagcccaa catgctacgc taagcctatc 4680 
tacagaaaaa tattttgtgt ctttaggcta agctcgagtc tactgcgctt agctcatgag 47 40 
taatatttta taaggcgcgc taagcccagc ctgctgcgct aagtgcccag ttcagttttc 4800 
agctttaatt ttttgttttt gatagaaata atcttattta accttgtggt ttgattttat 4860 
tctttcagat agcatcaaag aagagaaagg cacctgccac accttcccag gtctgatatg 4920 
gccgatcgag gttcacttct cttgtggcct aggaaaggta cactgatatt gtggtaccca 4980 
ggaagatact ccctgagtgg aatgtggtaa tctaccacac tgagtttgat gagtttaagg 5040 
aagaactaga gagaagaaaa tgggatgagg aattgaccag ttttgatgaa ggcaacattg 5100 
atgttgccat tctgaaagag ttttatgata acctctatga ttccgacgat aaatcaccta 5160 
agcaggtgag ggtgagaggc catttggtga agtttgatgc agacactctg aacactttct 5220 
tgaagacccc tgtgataatt gaagaggggg aaaagctgcc tgcctactct agatttgcac 5280 
tcttgagtcc tgatcctcaa gagttggctg ctaagctctg catcccaggg agggaatttg 53 40 
agcttaatgt tgacgacttg ccactaaaga tcctcaggaa gaaaatgacc acactcgctc 5400 
agactaggag tgttctttct tactccaact tggtccctac ctcccacact tctcacatca 5460 
cactggatcg ggccaagttg atttatggca ttatcatgaa gatggacatg aatttgggct 5520 
acctcatctc ccaccagatt tctatcattg cccagcatga ctcctctagg cttggattta 5580 
caaccttaat catagctttg tgtaaagcta aaggagtcac attagattcc aaatctttgg 5640 
agagtcttag ccctgccatt aacatggcat atataaagaa gaactgttgg aatctagatg 5700 
atccaacagt gacattcaga gagccaagga aggccagggg taaaagaatc gaggctcccc 5760 
ctacttcagc agcaccaggt gcttctgctc cttcttcatc ttctttacca gatccttcag 5820 
caccatccac ttcgactcca catcttccat ggttactagc ttcagctccc actcccttac 5880 
cagcttcaat tcagctcctt ctacaggacc ctccfccattc acctctaaga cattatttgc 5940 
tatgctgcaa agcctgcaca aaggccagat catcatcata cagaggttgt agagctctgg 6000 
ccagaaacca accatgagta tagaggagtt ccttgcacaa gtggcttgcc caggagtcga 6060 
gccttctcct tctggagggg gtgaggcctt tgcagcccaa gagccttgcc agcagagaag 6120 
cctgtgccag aagcagagga tgagcttgtt cttcctgagc catttgttta tgagattgat 6180 
ccagtcgctc aggaggaagc agcagctcag gagcttcctg cacctatttc tgaggatacc 6240 
ctgccatctg caccagcatt ggagtaagag cagcctagtt cacaggatcc accagctgct 6300 
ccaatgctgg atctgaacga gcatgcagaa gatcagcagt aggatgatca tgagttttaa 6360 
attctacata gtttttaaaa ttttgcaaat tatgaatagt ttcttttatc aattatttag 6420 
ttcatgtcaa ttatttgttt atgctttatt agtctttaaa ttttagtctt ttaaattttt 6480 
gttgtttgag tgttgatagc ttgtacaaaa gcatgtttga acagtgaact tattgattat 6540 
gatattcagt ggtgtgattt cttatgaatg aagtgtttgt gaatgacttg aatgagaaaa 6600 
tgtatgaatt gagtggactg gaatgattag atgtttgttt tgatcaagct tgtagtcatt 6660 
agaagaaaaa gaacatgtga ttagaagtat gactgaaaat gttagtcagt ttgtcaaatt 6720 
gattgtgaag gaatgcattg accgtatccc agtgagagtg tgatccttaa attttgagag 6780 
aaatgacttt aatttagcac taatttttgc acgaatcttt gaagtatgga ttgaatgcat 6840 
gaattgagga taatgaaggc catgttttga ttgtgatagc tatttagcca aaaagctgac 6900 
cttgtgcttg aatgatttat cccttgcacc cagtttgagc tgaatgaatt attgattgat 6960 
tgaaccttga gcctatatag tgttttctcc tgcttccttg tcttaggtta taggagagca 7020 
taatccacag aaaagcttgg ttcaaggcaa atttgttcca aatttggggg agacactggg 7080 
taaagaaata aaatggtcaa aacagagcaa catatacaca ttgttttctg tatgtaaaaa 7140 
aaactgtaag tataaataaa aatgtataaa agtgtgtgtg ctgcaaatca aatcaatgaa 7200 
agctaagtgc ttaataaaag gcaagtatgg ggtaggaatg aataaaaaaa aaagtaaagg 7260 
tttatctatg gatgaatgct ctcgtagaat ctaagctttt gaatcctaga aaaaccatga 7320 
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tttgttggca gcctaacctc attacaagcc tagaaagtcc tttggattca ttttgtgtgt 7380 
ttatttctgt atggtatgag atgaaatgca aaagttagga cttgtgttag ttgttcatga 7440 
tggaatgagc ctaaacactt aagcttgagt gaaacaatga ctgtgaggct ttggttgatg 7500 
attttttcct tgatatctgt cattctcact agcttatttt agttgtgact ctaatgcata 7560 
tgttcctatc tttgaaaaac tgcatgtttg tgaaaagaaa ttggttgaag cattccatga 7 620 
tattcatttc atatgattga atttctctgt gaggagaaca ccatttggat tgaccactgt 7680 
attttgtcac ttgaggacaa gtgaactgtt ctttctttgc ttgaggacaa gcaaaacttt 7740 
aaatttgggg gagtatgtta gtcatcttat acgactaact tttgtataga aaaaattttc 7800 
caaaacttgt atagtttctc caatttatag ttattttgta gggatttgta aataaatctt 7860 
gttttattgt tatagttgtc tctagaatat tttccatttg atttaatgat gaaatctgtt 7920 
caatttcagg ttaaaagagg ctaagtcttg aagtgctaaa agtgggattt acgctcagct 7980 
caccatttgg cctcaacgcg catccaccgc taagcacagc ttcagcgcac ttagtgtgac 8040 
agaagaatct ggcagagcat aaatatcaag gccgcttgct aagcaagatg gttgtcttta 8100 
gccagactca gcgcatgact ggcgctaagc tcaaatccac taactcgcgc taagcacagg 8160 
ggtggcacta agtgcaacgt cgcggattta aagcctattt aaagcctgtc ttgtgcagaa 8220 
ttaggtaata tacacacata gaattttagc aagcaataca aaattccaaa gcaaggacac 8280 
cacagtgcta atttcgatat agaagctctg gaggcagcaa gaggagaagc tttgcagaga 8340 
agcctaggat tcttcaatta gagagagatt agtgagctgt agagtgattg tgaggtgttg 8400 
agaagaggag gagggatccc ccttcttgtg taaggaacaa ttatttggta ctctcaaact 8460 
catttgtgtt agggtttttc tgtaatggct agctaaacac ccttgttggg gatttctaag 8520 
gaacaactga tgtaattact ttaatatcta attaattatg ttttatgtgt tcaatgcttc 8580 
tttcaatgct taattactgc atgctcttgg tctgatcacc catttgtgtg tattgttagg 8640 
tgactttagc attgggaaat gtaccgttgc cttagaactt gatagaagca ggactaaata 8700 
actacattac cagggatgga ttatggggtt ttggttttct aaatatgttg tgatgataat 8760 
gctatttaag ttaagcctag tcatacaaga gggatctgcg gacgaagctt aggttaaatt 8820 
agtataaact tacaagggat cgagatttag tactttaggc tacaacatag aacacaagaa 8880 
catgattaat tagagaaata tcctcatatg catcaacttg tttgttagaa agacccaacg 8940 
ctttttacct attgttgtca acttttactt acttgcattt tttttttacc atagaagtag 9000 
tttatttctg ttttaaccat caattatcaa tgttgttcca acaatgcctt acttctgaat 9060 
aaaactctgt ctaataagca agttccctaa attcgatact tggatcactc tgttttaatt 9120 
ttaaatactt gacaactca 9139 



<210> 23 

<211> 10482 

<212> DNA 

<213> Glycine max 



<400> 23 

tgttagtcgt cttatatgac taacttttgt atagaaaaac ctttttcaaa acatgtatag 60 
tttccccaat ttataattct tttgtaggaa tttgtaaata aatcttgata tgttttgata 120 
cctgccatta gagtatcttt agttggagtt aatgagaaaa tttgtacaat ttcaggtcaa 180 
aagaggctaa aatcttgaag tgctaaaagg agcagtcgtg ctaaatagag cctgtgggct 240 
cagtgcacat ccaccgctaa gtgcagcttc agcatgctta gcgtgacaag ggaacctgaa 300 
agagcacaag aatcaaggtc gcgcgctaag cgagacgttt gtcttttgcc aggctcagcg 360 
cacgactggc gccaagccca aatccactta ctcgcgctaa gcgcgatgtc gcgatttcag 420 
agcctattta agcctgaatt gtcagaatta gggtatgatt ttaagagacc agagctgtat 480 
atttttgcac aaacttcgag aatagtgctc tggaggcagc agagaggcag cagctaagca 540 
gggaagctag ggttcatcac tttgagagat tagagagtgt tttagtgatt gtgaggtgcc 600 
aagaagacga ggagggatcc cccttcctgt gtaagcaaca attgctctgt actttctgtc 660 
tcatttgtat tagggttcct tgtatggctt ggtaaaaacc ctagttgggg atttctaatg 720 
aacagttgat gtaattactt ttcatatcta attaattgtg ttttgtgtgt tcagtgcttc 780 
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tttcaatact taattactgc atgctcttgg cctgatcacc ctcttgtgtg tactattagg 840 

tgactttagc attgggaaat gtagtgctgc catagaacat gatagaagca aggctaaata 900 
actgcattac ctaggatgga ttgtggggtt ttagttttct tattatgctg tgatgataat 960 
gttgtttaag ttaagcctag tccaacaaga gggatctgag gatgaagctt gggttaaatt 1020 
agtctaaact tatgagggat cgaggtttag tactttaggc ttcagcatag aacacaagaa 1080 
catgattaat tagagaaata tcttcatatg cattaactcg tttgttagaa agacccaaca 1140 
ctttatacct attgctgtca actttttaat tacttgcatt tactgctttt taacatagca 1200 
tctagtttac ttttgtttat attctcaatt atcaatgttt gttcacacaa tgccatattt 1260 
ctaaataaaa ctttgtctaa taaacaagtt ccctgagttt gatactcgga ttattccgtt 1320 
ttaattttaa atgcttgata acctggtgcg ttttccgata tttcatttcc cttgaatata 1380 
ctgcttgtaa atttgataga aaggaactgt gttgaagggt aaacaaaaat ttgacacaaa 1440 
gcatttatgg cgccgttgtc ggggaactgg attcattaga agagttcagt tcagttttaa 1500 
ggcattgctt tattttgttt tctttaattc attgattctt tttgctaaca ttttagttac 1560 
tgcacatttt attgttcttt ggaattggat aatttttgtt ttgtttcttt tgtatgcaaa 1620 
ggagatctgt tgtaggtgat ttaattccca tagatttgga gattaatgct acttgcagga 1680 
gacaaaatgc agagagaatt agaaattttt tgcaggactt agaagtagca gcaactctag 1740 
gagagtgacc ctagaagatt actcaagtta aggccacagt ccaagcagct attagatgct 1800 
tctgctgggg gaaaaataaa gttaaagacc cccgaagaag ccatggaact cattgaaaat 1860 
atgactgcaa gtgacattac tattttgaga gatagagccc acattccaac aaaaagaagc 1920 
ctactagagc tttcatcaca agatgcattg ttggcacaaa acaagttgat gtccaagcaa 1980 
ttggaagcat tgaccaaaac actaagtaag tttccagctc aattacattc tgcacaatct 2040 
ttaccatcta ctattttgca ggtcacagtg tgtgccatct gtggtggagc tcacgattct 2100 
ggttgttgta tccccaatga agaaccaaca actcatgaag tcaattacat gggtaaccaa 2160 
cctagaaata attttaatgc aggtggattt cccgaattcc agcatggaca gtaatacaac 2220 
caacaacagg gacaatggag gaccaccctg ggaattaatt caatagagac cagggtggac 2280 
cgtccacaag gccgtaacaa caagggccta gtctctatga gcgtacaacg aagttggaag 2340 
agactctagc tcaatttatg caggtttcta tgtctaacca aaagagcacg gagtttgcca 2400 
taaagaattt ggaagtccaa gtgggacagc ttgcaaaaca gttggtggat aggccgtcaa 2460 
agagctttag tgctaacact gagaaaaatt cgaaggggga atgtaaagct gtcatgacaa 2520 
gaagcagaat ggcaacccat gttgatgaag gaaaagctta gaagaaggtg gaggagcata 2580 
aacaacagtt ggcagctgag ccggcacttg aacccatttc tgattttgtt gaacttgagg 2640 
aagttatgga agatgaagat gaccaaaagg aaaagagaaa gaagaagtag aaaaagaaaa 2700 
atattagaaa aatgaaaaag aaaatgagaa ggttgaggaa agaaagagga gcaagagtga 27 60 
ggtttcaaga gagaaaaaga gagagattac ttcagctgaa ggcaaggatg taccatatcc 2820 
attggtacct tccaagaagg ataaagagcg acacttagcc agatttcttg acatcttcaa 2880 
gaagtcggag atcacattgc cttttggaga aactctccaa cagatgccac tctatgccaa 2940 
atttttaaaa gacatgctga caaagaaaaa ctggtatatc cacagtgaca cgatagctgt 3000 
ggaaggaaat tgtagtgctg tcactcaacg catccttcca ccaaagcata aggatccagg 3060 
aagtgtcaca ataccatgtt ctattggtga agttgcagta ggcaaggctc tcattgactt 3120 
gggagccagt atcaatttaa tgactctctc catgtgccag caacttggag agttagagat 3180 
aatgcccact cgcatgaccc tacagttggc agatcgctcc attgctagac catatggagt 3240 
gatcgaggat gtgttgattc aggtcaagca gcttgtattc cctgcaattt tgtggttatg 33 00 
gatatagagg aggatcctaa cattcccata atcttgggac gtcctttcat gtccacgacc 3360 
agctgtgtag tagatatggg gaaaggcaaa ttagaactgg ttgtggagga tcagaaagtc 3420 
tcattcgact tatttgaagc aatgaagcat ccaaatgatc aaaaagcttg ctttgatctg 3480 
gataaggtag aataggagat agaattagct gctatagcca tggtactgca ctctcatttg 3540 
gaaaaagcac gattaatcat gtagaatgtt tgaccaagga ggaggaacat gaagtgtaga 3600 
cttgtattaa agagttggat ggtgcaggag aaaattccga gggacatact gcatttgaag 3660 
aattgaagaa cagtgggaaa atagaaaaac caaaagtaga attgaagact ttgcctgcac 3720 
attcgaagta tgtatcttgg aagacaatga ctccaaacca gtgattatta gcagctcttt 3780 
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gaagaaaaca gaagaagatc agttggtgca 
atggcacata tctgacttga aaggaattag 
ggaagctgat tacaaaccaa tgagacagcc 
ggaggtgcgc aaggaggtgc ttaagttgct 
gcgtgggtta gcccggtgca ggttgttctc 
gataaagatg aattaatatc cacaaggact 
cggaagttga ataatgccac ttggaaagac 
cttgagagac tcgcaaggca atcatattat 
tagattgcta tagatatcaa agatcaagat 
cgcgtgactt gcgcgtgcat gttccaagaa 
tatttgagga aaacgtcgga aaaaccggaa 
gttcgggagt tgtatttacg cacggggaag 
gatgacaacc tctaatcaaa tgtgcaaata 
tttcacgttc ttatgttttt tttatgcctt 
gggcgtttcc ctttgctcct acgtattcct 
gttcttttgt gaacaaagcg ttttggttaa 
ttttattgaa tgaaaggtca tttaaggtgt 
gaaaagtgag aaaacattaa ggcattggac 
taacaaagtt acatattgat tttaggcttt 
gaaaagacca tttcaaggcg ttggaccttt 
gtttggttta tgaattgatt ttagccttag 
aagaaagaga aatcccaaag aaaaacgtcc 
atatttttga ttattatatt attattttac 
gaccgaacag tcggatttca ttttaacaga 
gtggaaattt attttatttt ttgattaggc 
acgtcaaaag ggggtacgga aagtaaatga 
ggaccactaa gggtacatag aatgaattgt 
gaagaacgac gaagaacgaa cgaagaacgt 
acccacggaa acgttacgga agcacctcgg 
tcactaattt taagtgaatc tcagatacca 
ttcccttatt tataggaaaa ggaaggagat 
ggttgcttcc tccagaagca aatcctggaa 
ccccaatttt actaaatata ccccctgcct 
tggaaactta cgaatttcgt aacgatactt 
cggattacgt aatcatccct tttttgcctt 
cactaacact tccttttaat tttcggcatg 
ttttcttttg gcttccgaca tgtctcggaa 
atacctcgaa gtggtcaaac gacggtcgca 
tagggtatga cacaagagaa gacaactttc 
tgcatgcctt tcggtctatg caatgcccta 
ttttctgata tggtggaaaa atgcattgaa 
ccatctttga tggttgctta tcaaatctgg 
acctggtact taattgggaa aatgtcattt 
taaaatatca gtaaggggaa ttgaggtgga 
tcctcctcca atgaatgtca aacgaatgag 
gtgacttata aaagattttt caaaagtcgc 
tgttgctttt gtgttcaatg gaaagtgtat 
agtgtctgct ccagtaatta ctacaccaga 
cgcgagcgat tatgctatag gtgcagtgct 
tatctactac gccagcaaag ttttaaatga 
agaaatgttg gcaattgttt atgcacttga 



gattttgaag aaacataaag ctacaattgg 3840 
tccatcttat tgcatgcaca aaattattat 3900 
tcaaagaaga ctgaacccaa tcatgaaaga 3960 
agaagcaggc ctcaccccat ctcagatagt 4020 
aagaagggag gtatgacagt cattaaaaat 4080 
gtcaccgggt ggagaatgtg cattgattat 4140 
cattatccac tccctttcat ggaccatatg 4200 
tgttttctgg atggatattc tagttacaat 4260 
gtcgcaacct acccttcagt gggagggcga 4320 
aggaatacgc gcggagtcgc caccaacgtt 4380 
aagacgtgat ctacgaactt taagtgaaag 4440 
gtattagcac cccacacgtc cgtcacaaga 4500 
tgacttcaat ttatgttatc ttcccccttt 4560 
tttatgtttt tatctttttg tggttgacaa 4620 
caattgtgat gagaaaatca aacctacgta 4680 
gttatttttt atcctttttt gcaagatatg 4740 
tggaccatta gacaatcttt cgattctttt 4800 
cattaatgat ttctttattt ttgaaagagt 4860 
ttagaaatct acacttaacc aataaaagcg 4920 
gaaaaatggc gtttttaggc gatgacaaaa 4980 
tttcactttg gttattagtc gattcgattt 5040 
gattgatttt ttgatttatt ttactaaaag 5100 
ctatttttgg ttttcaacgg gttacggcat 5160 
aattaacgga tgttacaatt taaatgatcg 5220 
gagaaaatga cttaagtaaa tgactaaagc 5280 
aatgaaaata aaagcatgtg aaacaaatga 5340 
ttgatttcgg gaacttaccg gttgaagatc 5400 
cgatgaacgg ttgaaaatct tcgcaaaatc 5460 
cttggatttt cttcacggaa acaatttttc 5520 
ggagggtcga acatttttgt tcttccctcc 5580 
gcttgccacc cagctcgccc aggcgagcta 5640 
ggcccaagtg ggcctggttg ctatttgaac 5700 
ttttttggtg attctttttc cgtaaagtta 5760 
gttttctttc cgtaatgttg tggaacctta 5820 
ccggaacgtt acagaacttt acggattgca 5880 
tcacgaactt cacggattgt gctaccacgc 5940 
cttcacaaat tgcctaacca tgggtgccaa 6000 
tcccaacaac ggatggttct cggacgaaat 6060 
actttccctt tcggtgtatt tgcatatcga 6120 
gctacatttc agaggtgtat gatggcaatt 6180 
gttttcatgg acgatttctc tgtttttgga 6240 
aaagagtatt ttagagatgt gaagagtcca 6300 
catggttcaa gaaggaatag tgctggggca 6360 
taaggtgaag attgatgtca ttgagaaact 6420 
aagtttctta ggacatgatg gattctatag 6480 
caaaccactt agcaatttgt tgaacaaaga 6540 
tgaagcattt aatgatttga aaaccagact 6600 
ttgggggtaa gaatttgagt tgatgtgtga 6660 
tggacaaagg aagggcaaaa tttttcatgc 6720 
tgcacaggtt aactatgcta ccacagaaaa 6780 
aaagttcaaa tcttatttgg taggctcaaa 6840 
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agtcatcatc tacattgatc atgcaactat taaatatttt ctcaacaagg ccaattccaa 6900 
aaccctgctt aataagatgg attttgctgc tgcaagaatt tgatttggta attcgggata 6960 
aaaagggatc ggaaaatgtt gtagctaacc aatttgtcta gattggggaa taaagaagtc 7020 
atgtcgaaag aagctgaaat tagagatgaa ttccctaatg agtcattatt cttggtgaat 7080 
gagagacctt gatttgctga tatggccaac ttcaaagccg caggaatcat tccaaaagac 7140 
ctaacttggc agtagaggaa gcaattcctg catgatgctc gattttatat ctgggatgac 7200 
ccgcacttgt tcaagattgg agttgacaat cttctccgaa gatgtgtgac acaagaagaa 7260 
gccaagaaca tattatggca ctgtcacaat tctccatgtg gcggccatta tggtggagat 7320 
aagacgacga ccaaggtttt gcaatctgga ttcttttggc ccacactttt caaggatgct 7380 
catcagaata tgctgcattg tgatcaatgt caaaggatgg ggggcatatc aaaaagaaat 7440 
gaaatgcctt tacagaatat tatggaggtt gaggtatttg actgttgggg gattgatttt 7500 
gtaggtccct tccctttgtc tfcttggcaat gaatacatac tagtggttgt tgactatgtc 7560 
tctaaatggg ttgaagcagt ggctaccctg cataatgatg ctaagattgt ggtaaagttt 7620 
ctaaagacga acattttctc cagatttggg gtgcccagag ttttgattag tgatggaagc 7680 
acacatttct gcaataataa gatacagaag gtgttgaagc aatataatgt aacacacaag 7740 
gtagcatcag cttatcaccc ccaaaccaat gggcaagcag aagtgtcgaa caaggaattg 7800 
aaaaagattt tagagaagac tatggcttct actagaaagg actggtccat taaactagat 7860 
gatgctttat gggcgtatag gactgcattc aagactccga taggtttatc tccatttcag 7920 
atggtgtatg gcaagtcttg tcacttacca gtggagatga aatataaaac atattgggcc 7980 
ttgaagttgt tgaactttga tgaagccgaa tccagagaac aaaggaggct acaacttttg 8040 
gagttggaag agataaaatt aactgcttat gaatcttcac agttgtacaa agaaaaaatt 8100 
aaaaagtatc atgataaaaa actgctcaag agggattttc aacaaggaca acaagtgttg 8160 
cttttcacct caagacttaa attgtttcct gggaagctta aatcgaaatg gtctagacca 8220 
tttaccatca agaaagtccg aacatatgga gcagtggagc tttgtgatcc tcatatgggt 8280 
ggtgaacgga caaaggctaa agcaatatca tggtggagct attgagagat tgaacactat 8340 
tctacacttc aatccaggat aacaggacga tgcgtcaagc taatgacgtt aaccgagcgc 8400 
ttacggggag gcaacccagg tctcttttta tttctatttt tcttgcattt aatttagtta 8460 
gtttaattgc ttgtgattgt aaatgatttc taagcttggt tagtattgag aaaagggttt 8520 
caaagtttta gtaaagagat ggatagaaaa gacttagaga aaaaattttc agttgtccat 8580 
ccgctaagcg cagcccttgt gctaagtgcc atgtcttaat gcactaagca tgtgcttgct 8640 
tgcgctaagc actttgacct ttcaccagtt ggctagatgg ttcagctaag cgcacatcac 8700 
tgcgctaaac ctaagttctt ctctggattt gaacttcatg acttgggctt agaggagttg 8760 
atgcgctaag cgcaactcct tctctgttga aaaattattg taatagcatt aagcttaatt 8820 
tcctctctgg aattgaactt tcaggaattg ggcttagcag caggatacgc taagcgccaa 8880 
tccttcacta ttttgaaata cttggaattg cgctaagcct ggaaccatca ctgtaagtag 8940 
agcttgtttt agtgctaagc ctaacatctt aggctaagtg aaaattgcag gaccaatcag 9000 
agttgcagac agtgctaagc gcgtgtcctc gcactaagct tgaatacctc tctggaattt 9060 
gaaattattg aattaggctt aacgcgagag gtggcgctaa gcgcatgggc cttaaactca 9120 
aatgtcatgt tggcatgcta agcgcaacta tgcgctaagt gcgccaaaca aaaatgctaa 9180 
aataaaatag aactaccaat ggcagttacc atttacactt caaagctttt actcccttat 9240 
gcttgtgccc acattcgtgc ttttgtgcat tttgctgcct ttgcttcaag ttattcctgc 9300 
tttcttgctc tcatcttgca tttccatcac aatccaagta agttttcatg tttattttca 9360 
ttttctttta taagcttaaa ccttagggta gatgatttag tgctttttag tttgcaattt 9420 
tttttaggtt tagtgttttt aggttagttg ttagttaagg taggtttagg gtttacaatg 9480 
taggttttag gttaggtttt tgagcccctt aggggcaatg cctgaaaaag gggtgaaaac 9540 
ccgtgagtaa tttctagaaa tagcgatgaa cgtgctaagc gcacctgctg tgcttagcca 9600 
gttcatcgca acttccttct aatgagtttc aatgatgagc tcgataagcg cgtttgtgcg 9660 
ctaagtgaga caagtgtttt agacacttag tatttttttc aatttttgtt cagcactaaa 9720 
gcctggcttc tcaggctaaa gcacaattct gtctttattt ttcaattgtt ggaataaggc 9780 
taagtgcagc ttgttgtgct aagcccatgt tatgtcttag tgaggttgag ctaagcgtgc 9840 
cctactgcgc taagctcaat tcctccactg ttttcaaaag tgtggattta ggataagccc 9900 
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agcttgttgc gctaagccta gtctatggaa 
gtggctatcg ggcttagccc atgagtaaat 
gtgctaagca cccagtccta ctttcagttt 
tttaactctg ttgtttgatc taattctttt 
gcctcaacat cccaggcccg ctatgataga 
cgttattcta gtgttgtcat tggcaggaaa 
tacacagagt ttgatgaatt cactgaagag 
acaaatttta tggatggcaa cattgatgtt 
tatgacccag aggataaatc acctaagcag 
gatgcagatg ctctgaacac tttttttatg 



aaacattttc tgagtactca cgctaagcgt 9960 
tttcataaag cgcgctaagc ccagccttct 10020 
tatttttttg tttttgttga ataatcctgt 10080 
cagatggcat ctaggaagag aaaggcccat 10140 
tccagattca catctcagga ggcctgggat 10200 
atattacctg aaagaaatgt catgctctat 10260 
ttagagagaa gaaacaggca caaggagtta 10320 
gccattatga aggagttcta tgctaacctc 10380 
gtgaggttca gaggtcattt agtgaaattt 10440 
acccctgtga tc 10482 



<210> 24 
<211> 1857 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 24 

atgagcaatt acagtggcag ttcttctgtt 
tcgtcatctt caaggccaga gagagaacag 
gagatagccc gaggaaagag agcgatgaga 
gaggacgagt acatgcctga acagactcgc 
atattgcctg ctgaggaata tgttaggctt 
tatccttgct cgacctcact tgcacaactc 
caaagttgtc atctggacac tttgatggct 
atacaattcc tctccacact acaagtagag 
gattgtgaag gattgggatt cttgcgattt 
atcaagcgat tggaaggatt gtttgatttt 
gaaagagaag agttgaaaga cttgtggatc 
tccaggtcaa agagcaatca gatacgcagc 
gccaacgtac tctactcccg agagattaca 
atcgcaatgg ccctcaaagg aactctccgc 
gaagtcaatg acacacctct ctctatactt 
tgggcggtca gcaataaccg caagagagca 
acacctattc tgatagcttg tggagtccca 
atggatatcg agcacctacg tcactgccaa 
cacaggttca ggtttgagca ctctacagac 
gaggtcacac ggataatcga gggagataac 
tactatgaga acgctccacc attagatgag 
gggatggatg aagatggagc agtaaagttc 
gtacctccag cgaggcagag caagagcttg 
cagaagtggt gcaagaagca ggacaggctg 
aagctgagtt gctcttcctc caccactgct 
ccatcgagga gaattaatgc acctgcgcac 
catgtccagg ctaggcattc gtcattcgaa 
acactcactc gatctagcag cagatcacgc 
ggtgctggcc gcagcagaag gagagatgtc 
agagctgatg aggtcgagta cccatctgct 
atggcctggg agcaatcgca ggcagccatt 



gatcctgact acaacatgga tgagacagaa 60 
agagaatacg aaagtttcag aaggaaagct 120 
gagaggtatg agcttataga cgaagatctg 180 
agagctacca aacttctgca caagcccgac 240 
ttcaagctga atgagttctg tagcacgagg 300 
ggattgttgg aagatgttca gcacctgtac 360 
tatccgtatg tagcatatga agatgagaca 420 
ctctaccaag gtatgacctc tgatgagttg 480 
tctgtgtatg gtcatgagta caggttatca 540 
cccagtggaa cgggatctaa gccaaagtat 600 
accatcggca gctctgtacc gttgaatgct 660 
cctgtcatca ggtacttcca gcgttctgta 720 
gggactgtca ctaactctga tatggagatg 780 
caaactaaaa atggcatgtc cctccagggt 840 
cttctgatcc atctgtgtgg atacaaaaac 900 
cgaggcgctc tgtgcatagg tggcgtggtg 960 
ctcatttctg ctggactcga gccacgagca 1020 
ttcctggagt ttgcaatggt tgacgatttc 1080 
aggagagcta acatccttct ccctagccct 1140 
attgatttta ggcctgagat tggacgcctc 1200 
gacgatcttc ttgaagaagc tgcttcggat 1260 
gacactagca tgtatcactt tgctgaacat 1320 
actgaagctc ataagaatta cagtaaattg 1380 
atcgccaagt gtttcaagct tctgacagac 1440 
attccacagg tacaacctcc tatggaaatg 1500 
aggcctgagc ttagcgagca gagagtccca 1560 
tcccgggaac acaagagaag aaggaaggct 1620 
ctcattcact cgaggagatc actcgaccgt 1680 
gagtttcctc agagcggtgc tggccgccac 1740 
ggagctgata cagaacaagg aggttcgtct 1800 
gacgagcaac tacgttcatt cttcgac 1857 



<210> 25 
<211> 1254 



40 



<212> DNA 

<213> Pisum sativum 



<400> 25 

atggaatcca ggtccggagc ttcgaaaaag 
cccatacaat tcgacaccga caaatttgtc 
ttggaaaagc gaaagatttt gccggaaaag 
cgtacattcg ccgggctgat taacagcaaa 
cattacgaca tcgcaacagt gcgtgagttc 
ccattcacat ggacgtctag agtgtccggc 
aaccgtgtcc tgggtgaacc gctccatctg 
gatttaaggc ttcaccggga taccgattcg 
tcagttgagc tgaacccatc tggggttccg 
ttggctcaac tgatoctttt gttggttctt 
accgtgccga tcccagtggc acacttggta 
gtggcaagga ttattgcttt ggagttgaag 
gaacgagtga attgtcccct tgctttccct 
agggtgaggc taccctccaa gggtcaagta 
gtggccaagt actgcaaacc gaagaatgta 
gcttctgatg gtcctggtac ttttactcta 
aactacaact gggattggat ggcggcaact 
atgcagctgt tacagttgca gatgcgcgac 
gagcagtttc tgcagcacgc tagctggcct 
ggtgctggtg caactggtgc tggtgctttt 
gaggctaccg gttctgaagc cggtagtgat 



agaaagggcg ggaatagttc ccgtcccgtg 60 
gggccaaagc aagcagtaag atatgttgct 120 
agatttataa tcaaccctga aggcacgaac 180 
aagtgggacc ggttaatatc ccccttgaag 240 
tacgcgaacg cactgccgaa cgacgacgag 300 
cgtcctgttg cgttcgatcg ggatgcaatt 360 
ggagccaatg agagagacac ttaccaccaa 420 
atttctactg ccctgctttt ggaagggaaa 480 
atgagatacc atagggagga catgattccc 540 
acaaacatca aacccaagtc tcacacttct 600 
cacatcatcc tcacgaatat ccagattgat 660 
tccgtgattg aaagcgggct aaagtcgggg 720 
tgtctaatca tggctttgtg ccaacaagcg 780 
aggatcccgc cggccattga tgaccgatac 840 
agaagtagtt cagctgctga ggttaccggg 900 
ggatccgatc ctttccagca ggctgtctgc 960 
cagcgcgtca tgctcgatat gcacgattct 1020 
ccctccggtg agpattctat gatgtcacgt 1080 
gtggacaggc ctgtgtttgg agagggggcg 1140 
tctggtgctg ctgatgatga tgatgatgat 1200 
gagggttatg agtccttgga gggc 1254 



<210> 26 
<211> 564 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 26 

tgt gat teat gecagagaaa aggcaacatc aatagaagaa atgagatgee tcagaatcca 60 
atcttggaag ttgagatctt tgatgtatgg gggattgatt ttatgggtcc attcccatct 120 
teataeggta ataaatatat actggtcgcc gtagactacg tatcaaagtg ggtcgaagct 180 
attgetagtc ctaccaacga tgcaaaagtt gtgctgaagt tgttcaaaac cataatcttc 240 
ccaagatttg gagttcccag ggtagtaatc agtgatggcg gaaagcattt catcaacaag 300 
gtttttgaga acctcttgaa gaagcatggg gtaaagcagg ttgagatctc caatagggag 360 
ataaaaacaa ttctggaaaa gactgttggg attacaagga aagactggtc tgcaaagcta 420 
gatgatgeat tatgggctta caggacagct ttcaagaccc ccataggtac aactcctttc 480 
aatcttctct atggaaaatt atgtcatcta cccgttgagc tcgagtacaa agcaatgtgg 540 
geggtaaaac ttctgaactt tgac 564 



<210> 27 
<211> 600 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 27 

ettgatgetg gtgtcatcta ccctatctct gatagtactt gggtttctcc agtgcattgc 60 
gtccctaaaa agggtggaat gactgttgtc aaaaatgaaa aagatgaact gatccctact 120 
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agaactataa ctggtcatag aatgtgcata 
aaagatcatt ttcctttacc attcattgac 
tattattgct ttcttgatgg atacagtggt 
caagaaaaaa ccactttcac gtgtccttat 
ggtttatgca atgctcctgc aacatttcag 
atcgaggaga tggtggaggt tttcatggac 
tcatgtttgt tgaatcttgg cagggtattg 
aattgggaaa agtgtcattt catggtgaag 



gattatagga agttgaacgc tgcatctagg 180 
caaatgcttg aacgtttggc taatcatcca 240 
ttctttcaaa taccaattca ccctaatgat 300 
ggaacttttg cctataaaag aatgccattt 360 
aggtgtatga cctctatatt ttcagactta 420 
gatttttcgg tctatggccc ctctttctcc 480 
actaggtgcg aagagacgaa tcttgttctc 540 
gaaggcatag tattggacca caagatatca 600 



<210> 28 
<211> 192 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 28 

tttgaaatca tgtgtgatgc atcagattac 
gacaagaagc ttcatgtcat atattacgcc 
tatgcaacaa ctgagaagga gcttctagct 
tatttggttg ga 



gcagtaggag ctgttctagg ccagaaaata 60 
agccgaacgt tggatgacgc tcagggaaga 120 
gttgtattcg catttgagaa gttcagaagc 180 

192 



<210> 29 
<211> 597 
<212> DNA 

<213> Pisum sativum 



<400> 29 

ttggatgcga gaatgattta cccgatctcg 
gttccgaaga aaggtggaaa taccgtcatc 
aaagttgcaa cggggtggag aatgtgtatt 
aaggaccatt ttccactccc gttcatggat 
tactattgtt tcttggatgg ctattccggg 
cattaaaaga cggctttcac atgtccgttt 
gggttgtgca atgcaccgac gactttccaa 
aatgagaaaa caatggaagt cttcatggat 
ttatgcttgg caaacttgaa aacggtgctt 
aattggtaga agtgccactt catggtgacc 

<210> 30 
<211> 192 
<212> DNA 

<213> Pisum sativum 



gatagtccat gggtcagtcc cgtgcatgtg 60 
cggaatgaca aggatgaatt gatccctacc 120 
gaatataggc ggttgaatac cgcaactcga 180 
caaatgctgg aaagactctc cgggcaacaa 240 
tataaccaaa ttgccgttga cccggccgat 300 
ggagtgttcg cataccgaaa aatgtccttt 360 
cgatgtgtgc aagccatttt tgccgacctt 420 
gacttctcgg tatttggtgt atcctttagt 480 
gaaagatgtg tgaagaccaa tcttgtgctt 540 
gaggggatag tgcttggcca taaagtc 597 



<400> 30 

tttgagctaa tgtgtgatgc gagcaactat 
gagaaaaaat ttcatgcgat acattacgca 
tatgccacca ctgaaaaaga attacttgcg 
tatcttatag gg 



gcaatcggag cggtattagg ccaaagaaaa 60 
agtaaagttc ttaatgaggc tcaaattaac 120 
atagtgtatg cacttgaaaa gtttaggtct 180 

192 



<210> 31 
<211> 581 
<212> DNA 
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<213> Pisum sativum 



<400> 31 

tgtgatagtt gccagagaag cggtgggatt 
atccaagagg tcgaagtatt tgattgttgg 
cttatggtaa cgagtatatg cttgtcgcag 
cgaaaacggt aataattttt ttgaagaaaa 
tgttgataag tgacggaggg tcacactttt 
attacggtgt atcacacaga gtggcaactc 
aggtctctaa tcgtgagatt aagagaattc 
agtggtcaca aaaattggat gaagcgttat 
ttgggctcac tccttttcaa ttggtgtttg 
agcacaaagc cttgtgggct ttgaaaatta 

<210> 32 

<211> 1362 

<212> DNA 

<213> Glycine max 



ggtaagagag acgagatgtc tctccaaaac 60 
ggcatcgatt ttgtaggacc attcccccct 120 
ttgaggcgat tgcctcacct cgggcggatg 180 
acatattttc ccgtttcgga accccccgag 240 
gtaatgcacc gttggaaagc attttaaaac 300 
cgtatcaccc acaggctaat ggacaagccg 360 
tcgaaaaaac tgtgtcaaat tcgaaaaaag 420 
gggcataccg taccgccttt aaagctccaa 480 
gtaaaacttg ccatttgccg gtcgaattgg 540 
ataattttga a 581 



<400> 32 

atggcctcct gtaaacaccg agctgtgccc acacccgggg aagcgtccaa ctgggactct 60 
tcacgtttca ctttcgagat tgcttggcac agataccagg atagcattca gctccggaac 120 
atccttccag agaggaatgt agagcttgga ccagggatgt ttgatgagtt cctgcaggaa 180 
ctccagaggc tcagatggga ccaggttctg acccgacttc cagagaagtg gattgatgtt 240 
gctctggtga aggagtttta ctccaaccta tatgatccag aggaccacag tccgaagttt 300 
tggagtgttc gaggacaggt tgtgagattt gatgctgaga cgattaatga tttcctcgac 360 
accccggtca tcttggcaga gggagaggat tatccagcct actctcagta cctcagcact 420 
cctccagacc atgatgccat cctttccgct ctgtgtactc cagggggacg atttgttctg 480 
aatgttgata gtgccccctg gaagctgctg cggaaggatc tgatgacgct cgcgcagaca 540 
tggagtgtgc tctcttattt taaccttgca ctgacttttc acacttctga tattaatgtt 600 
gacagggccc gactcaatta tggcttggtg atgaagatgg acctggacgt gggcagcctc 660 
atttctcttt agatcagtca gatcgcccag tccatcactt ccaggcttgg gttcccagcg 720 
ttgatcacaa cactgtgtga gattcagggg gttgtctctg ataccctgat ttttgagtca 780 
ctcagtcctg tgatcaacct tgcctacatt aagaagaact gctggaaccc tgccgataca 840 
tctatcacat ttcaggggac ccgccgcacg cgcaccagag cttcggcgtc ggcatctgag 900 
gctcctcttc catcccagca tccttctcag cctttttccc agtgaccacg gcctccactt 960 
ctatccacct cagcacctcc atacatgcat ggacagatgc tcaggtccfct gtaccagggt 1020 
cagcagatca tcattcagaa cctgtatcga ttgtccctac atttgcagat ggatctgcca 1080 
ctcatgactc cggaggccta tcgtcagcag gtcgcctagc taggagacca gccctccact 1140 
gacagggggg aagagccttc tggagccgct gctactgagg atcctgccgt tgatgaagac 1200 
ctcatagctg acttggctgg cgctgattgg agcccatggg cagacttggg cagaggcagc 1260 
tgatcttatg ctttaatgtt ttcttttata ttatgtttgt gttctctttt atgttttatg 1320 
ttatgttttt atgtagtctg tttggtaatt aaaaagaggt ag 13 62 

<210> 33 

<211> 192 

<212> DNA 

<213> Glycine max 



<400> 33 

tttgagttga tgtgtgacgc gagcgattat gctataggtg cagtgcttgg acaaaggaag 60 
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ggcaaaattt ttcatgctat ctactacgcc agcaaagttt taaatgatgc acaggttaac 120 

tatgctacca cagaaaaaga aatgttggca attgtttatg cacttgaaaa gttcaaatct 180 
tatttggtag gc 192 



<210> 34 

<211> 597 

<212> DNA 

<213> Glycine max 



<400> 34 

ttggaggttg ggctcatata ccccatctct 
gttcccaaga aaggtggaat gacagtggta 
cgaacagtca ctggctggcg aatgtgtatt 
aaggaccatt tccccttacc tttcatggat 
tactactgtt tcttggatgg atactcggga 
caggagaaga cggtctttac atgccccttt 
gggttatgta atgtaccagc cacatttcag 
gtggagaaaa gcatcgaggt atttatggac 
agctgtttga ggaacctaga aatggtactt 
aattgggaaa agtgtcattt tatggttcga 

<210> 35 

<211> 603 

<212> DNA 

<213> Glycine max 



gacaacgctt gggtaagccc agtacaggtg 60 
caaaatgaga ggaatgactt gataccaaca 120 
gactatcaca agctgaatga agctacacgg 180 
cagatgctgg agagacttgc agggcaggca 240 
tacaaccaga tcgcggtaga ccccatagat 3 00 
ggcgtctttg cttacagaag gatgtcattc 360 
aggtgcatgc tgaccatttt ttcagacatg 420 
gacttctcgg tttttggacc ctcatttgac 480 
cagaggtgcg tagagactaa cttggtactg 540 
gagggcatag tcctaggcca caagatc 597 



<400> 35 

tgtgataaat gtcagagaac aagggggata 
atcatggagg tagagatctt tgatagttgg 
tcatacagga atgtctacat cttggtagct 
atagccacgc tgaaggacga tgccagggta 
tcccatttcg gagtcccacg agccttgatt 
cagfctgaaga aagtcctgga gcactataat 
actcagacga atggccaagc agaaatttct 
acagttgcat catcaagaaa ggattgggcc 
aggacagcgt tcaagactcc catcggctta 
tgtcatttac cagtagagct ggagcacaag 
gac 



tctcgaagaa atgagatgcc tttgcagaat 60 
ggcatagact tcatggggcc tcttccttca 120 
gtggattacg tctccaaatg ggtggaagcc 180 
gtgatcaaat ttctgaagaa gaacattttt 240 
agtgatgggg gaacgcactt ctgcaacaat 300 
gtccgacaca aggtggccac accttatcac 3 60 
aacagggagc tcaagcgaat cctggaaaag 420 
ttgaagctcg atgatactct ctgggcctat 480 
tcaccatttc agctagtata tgggaaggca 540 
gcatattggg ctctcaagtt gctcaacttt 600 

603 



<210> 36 

<211> 150 

<212> DNA 

<213> Glycine max 



<400> 36 

cctaaaatac tacaacgaca tgattggtgt 
caatttggcg ccgttgccaa ttgggtgttt 
tagatcaagt tctttttcaa ttttcttttt 



tttaggataa ttgactgaaa aacctattat 60 
gtttgttaca tttgagattt cagacttgct 120 

150 



<210> 37 
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<211> 11 

<212> DNA 

<213> Glycine max 

<400> 37 
tggcgccgtt g 

<210> 38 

<211> 15 

<212> DMA 

<213> Glycine max 

<400> 38 

tggcgccgtt gccgg 

<210> 39 

<211> 27 

<212> DNA 

<213> Glycine max 

<400> 39 

tttttggcgc cgttgtcggg gattttg 

<210> 40 

<211> 9 

<212> DNA 

<213> Glycine max 

<400> 40 
tttggggga 

<210> 41 

<211> 16 

<212> DNA 

<213> Glycine max 

<400> 41 

tttaatttgg gggatt 



